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RANDOM AND SYSTEM AT If' AI IK A NO EM ENTS 

Rv IIAHOLD JEFFREYS, F.R.S. 


I it \vr followed with interest flit* recent dtKcmHiuuH of thin question, though 1 
cannot rlrtim {MTHoiiiJ knowledge of (hi* types of problem when* the proponed 
arrangements an* used. My j<er*utnil feeling, as a KeDmologiist dealing with 
natural rurlhqimkes, i* one of envy of workers in eidijeef,* where it in possible 
trt design rxjterunenl.* at all. ami espeeiully where they can hi* designed in such 
;t way |ha! the unrmnl mpi/ilions fir the various pom meters to ha estimated trill 
i'Hiiiain no t-rnttH terms. Nevertheless, 1 think that some point* concerning 
exjsrimentnl design an* in damier «*i" Mug »»veil min'd even hi the fortunate 
subjects where <J»*i-ign i« pmedble. In particular, the word “m»»It>iniit*K»’‘ appears 
to he used in t«n ■•ruse.,, which art' »<<f equivalent. To take itt) illustration from 
ci'-mic **nrvry, the espesum-nt w.mid run«ist in the discharge of an explosion 
a! a kit*ran pend and the recording «*l the times of »r«n«iHHf*ion of elastic waves 
through lit" gr*<nnd to n set *4 recorder* at known distances. The choice of the 
disfomea is pur) of the design of the experiment. tin* problem » to find a pair of 
puraiuetem n and h m ucewatoly a« pimmide Loin oltwcrvntiotiK of the times t, at 
de-tame* x, Tin* will he done by the method of least squares, the uquntinna of 
enitditimi h-iiiit a ^ ), X ' .^ 


Now the * joe te.n of design will he, h<*w should the arbitrary dwtancoa x r hr 
rh«*mi! Itt pr.e ti*tlm inter vnE ore mm.dh made an nearly equal »a jHiasible, 
so that there bur high degree ot a\>tem m the design Hut un advocate of extreme 
randomness in design would apparently all the distances hy Tippett’s 

numbers or some analogous device, A possible cmtwqmtncs of this would he. 
that they would id! !«• nearly equal, and it would he imposaihle to determine 
either >i or ft without external evidence con»a*rmng tin* value of the other. Or they 
might all he euHiviit rated near s w ■* valm-a, then *i and h mtild be determined, but 
there would Ik* no mean* >4 jesting wise! h**r tin* time of framsmiatiun is adequately 
rcpremilpd by « linear formula there «rr nun** m prm-tjee where a square or a 
cube term needs to fir* included, though the linear form is usually adequate. Thus 
randomness in d«*#igu may »t» ***>me rirnmnttamw lead to the low of information 
that would W provided hy a -systematic design. 

itu the other hand it i» claimed that a systeniatie design may lead to an 
underestimate >.f the uncertainty, and it is here that the ambiguity of the use 
nf “ random lire*'' enters, The validity u f the method of leant squares rt*wt« on the 
hvj«»fhms «f randomness in the mt®< that jwrti meters« and b exist such that, 
given their values and the standard error, the probabilities of the residuals at all 
the distant** used are indi'i«mlent. Tin* may he true without the distances 
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Random and HyMrmatk Ananyrtarnt* 

ness of design will then permit several observation* »« to war tto w mm**, 
mi the probabilities of the residuals, given t he dwUnrew. v> tU w A to wlepwlent 
If in an actual caw.*, the variation of level wiudmnuom«\ and the reorder* were at 
uniform intervals so as always to hr on the create. there would to * *y*ii*m*t»«* 
error in the least squares solution; Imt this ran in this problem to calculated uwS 
allowed for. In practice, however, variations of level are usually «ud 

independence of the errors is heat achieved by using uniform interval* m m to 
minimize the increase of uncertainly due to jHwrihlc wrretoitw between 
neighbouring distances. This dews not exclude t he jwsihility t hat n closer apprem 
mation might be attempted by including it term proportional to the bright- in 
the law assumed; but that would still not eliminate pi wo bio effect* of hwnl 
variation in the velocities immediately below the recording twtrumeiite, and 
the systematic design would still approximate to the condition of ntdejwndetiw 
of the errors better than the random one would. 

My excuse for mentioning this problem in a mainly biological journal ta that 
it is one that I am fairly familiar with, anti that it seems to illustrate two points 
that have not been adequately analysed in the biological literature. The first w 
the absolute necessity of distinguishing totwmi the prntmhihfi"* .4 the «>we 
proposition assessed cm different data. Before designing tin* alum* experiment 
we may have only the vague knowledge of n and h suggested by rimiter expert- 
ments elsewhere, If we are t o make a systematic design we do know «»mrtl»*ng 
about what the relations between the intervals wilt be; if wo are to make a ramtem 
one we do not. But in either case, as soon as the experiment i* designed, the 
distanoeB are definitely known in both cases, and are not random any longer 
The knowledge of the possible types of design that might have lawn adopted im no 
longer relevant in either case, since we know the particular design that hem torn 
adopted; it is now part of the data of the problem, Tim estimates will in cither 
case be made using the actual distances b» data, not wane aggregate *»f fire 
possible distances that might have occurred with that method of rimgn Tim 
hypothesis of the randomness of the residuals, which t# needed for the validity »f 
the method of least squares, has nothing to do. intrinrirally, with the intended 
randomness of the original design; in this cm> the latter nutdutntMM would 
in practice often vitiate the validity of the former, while in any earn* iitmmditg 
the uncertainty of the estimates. 

The second point is that any method of estimation premippoMw a. tew of error 
or chance containing adjustable parameters, such that if these were known tins 
probabilities of various experimental results am assigned-Tn other words, that 
the likelihood, given these parameters, has a definite value. How 1 maintain 
t &t this is always what logicians call a considered proposition, not m <sm«rkd 
one Ihere is no proof in any actual case that no parameters other than than m 
tar thought of will ever be required; any theory of significance teste contemplate* 

e possi i y t at others may be needed. But if a new parameter, whose value 
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is n*4 vf-t kjinw jt, may Itc relevant, then the law of error, given those already 
cumodeml, is not unique and the likelihood has not a definite value. In the 
mstmt* survey, we do not assert that the relation' between distance and time of 
travel i* linear; we merely consider it and use it until we find some definite 
evidence against if in a particular ram-. But we need the information that will 
find out if it in wrong; and thin must he done by making sun that the presence 
of higher jaiwete ran he tested. This i» done by the systematic design. The random 
design would leave it to chance, and abandon the attempt to test the law if it 
should give an arrangement with all the distances concentrated near two values. 
It would be pleasant to he able to say that we already know what parameters 
can Iw relevant; but we have to face the fact that we do not. What wo can do is to 
provide a procedure for testing them as they arise; hut this necessarily implies 
that we must proceed from the hypothesis involving fewer adjustable parameters 
to the one involving more. The hypothesis that a limited number of parameters 
is adequate must be mmitlrml in any case, but that is not the same as saying 
that no others will ever bo needed. Estimates, however, are always subject to 
the hypothesis that no parameters other than those explicitly considered are 
relevant There appears to lie no escajH; from this dilemma; we can only admit it, 
state the hypothesis explicitly, and say that the results are the best that we can 
give in the actual state of our knowledge at the time, f Confidence in it will depend 
mainly on the failure to find evidence for other parameters that might conceivably 
have been relevant to the observations already available, 

Isrt us now consider how this may apply to problems of sampling a population 
and the design of agricultural experiments. In t he former ease the population 
consists of several different types, of numbers i>„ a,,.... e ; ,, total >>. The numbers 
of these, types in our sample are n lt n s , total n. It is notin our power to 
change the ly. and practical considerations may fix n. But n v n v ... are at our 
disposal. The question is, how should they he chosen ? Neyraan (1934) has shown, 
that if the standard errors of the respective types are (t y ,(t^ ..., and the means 
fount! for the types are x n then provided that the iy are known the most accurate 
estimate of the moan for the entire population would he got by taking the n r 
in proportion to v r <r r , atid the estimate as ^v r xjv, If all the<r f are equal, the n r 
should be taken in proportion to the v r , and the best estimate of the mean for the 
imputation will be the mean over the sample, £n r x r !n, irrespective of the differ¬ 
ences bet ween different types. The standard error also is independent of those 
differences. It reduces simply to a matter of estimating the mean for each type 
from a sam ple of that type, and its uncertainty is determined only by the variation 
within types; and the mean for the whole population is the mean of the type 
means, weighted in proportion to the v rl which are in proportion to the n r . 
Now this is a highly systematic method of sampling. The random method would 
be to choose a individuals from the entire population by selecting from a directory 
by some set of "random numbers". The numbers of the types in the sample 
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would then depart from prnp»rtK*u»htv.* r7 ' T ' 
a U( l ifthemmn.»fth««MnpU* i« t »ken »«tbcr*s,nw^4 * w «■’•*» 1 1’' iia1 ’’ ’* 

it will have an additional ctT»r««namnMrf **f *h«* if m 

of the type differences and the Munphug em*r« It v« that tV 
error of the result «* obtained if «-* Trebly mhmnUA ln-m *■!«' «."**» 5 d ! » ,r 
variation within the sample iw a wk«h-. hwt if *tB 1* l*rit« than that -4 a- 
mean of a projiortional «mph* H instead «»f taking the mean »d th»* jm 

our estimate of the mean of the population. thn# tngh'my the tyj»- reran* «•> 
cording to n r . we weight them according ?«* » r . »n re** 1 ’'*’*' an iwjimnre th*t 
is independent of the variation of the ?\j*c uevm* »nd fb«‘re*-"re ha* a !-»^f.? 
accuracy, approaching that for the pr<>p«rfreiiid **nq h> 

To apply this met hod it in nrmwarv that th** >, i*n- ah t >** hi.--at it » ••» y 
unknown the only way uf estimating ihem would 1** by * t«t.. t--u. •; <• 
from the, whole population, anil their tnoel ptolwble value* would f m m S In- r ^ 
of the it, Then the mean of the «unj>l«* w«nM l*r* the tayt **.*tim*t* k| th«* 
latitm mean, and it« standard error would tr correctly i> «nd <r*.m ftu 1 Aro»i. n 
in the whole sample. The point, in that information about t|.«’t. c> t*dm ?- the 
estimate of the population mean and its standard om.r, g»«**n to* ««..!• jj • « i 

this will be our actual position, since onr problem n* in pr .»•!!• >' f»- '*..»» t v«>v rere-w 
about the population given the sample, if »«* d<* re.r h tm> j-v/pulatren 
numbers apart from what the sample can tell us atmiu fh«-m th«T" h w* more h* 
be said; but if we do know them awl omit to mm ih«un »*• »n« <iiv«r>tmg valuable 
information and introducing avoidable error into tjm whreare <4 she wean Id ■ 
conclusions drawn from a given sample are m«t the same if t J»r » , are aW> p-»n .• ,* 
the data as they are if the jy are unknown. 

On the other hand it is claimed that the method <»l mndwu wmii-lr.,* n 


unbiased. Now the word “bias" also seems to l* capable «4f wm«i r>- 
tations. If the standard deviations within arc different. but fl ( « jr 
are known, it has been seen that the brat estimate will be obtained by <f<dif*m«Seh' 
introducing a departure from proportionality into the aamphog and t*if/*wmg I t 
it afterwards by weighting according to iy, It might bo *ud, not tmtourh. that 
any procedure that give# an estimate different from the hart utw n*mg tb** w huh 
of the available data is biased. It is true that random sampling at th*> untsei t* 
designed to give every possible sample of given ante an equal eh«An» of Ham# tbo 
actual sample; but when the sample has actually bt«n t*ken. that is the «an>pk< 
and the intentions of the designer are no longer relevant . If w# know that 11 » 
not in fact proportional wo are entitled to allow for the depart tire and shall gain 
accuracy by doing so. The faot that the use of the sample mean a* the cwtimau 
might give an error with either sign in a future experiment doe* not imply t!»t 
in a particular case it is equally likely to have given one with either sign. It.» 
sometimes said that by taking a random, sample m avoid dangers that might 
arise through a systematic sample nob being in proportion. Thus Yule & Kendall 
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itiduditijt h» (wtimable error that him nothing > j,>j S hef l) : e fil» 1 e xp ecli0t 
{ J»a%T lawn jm&sded hy the roeent eritieiHiiW ^ h fl j [ ia ,ye n® 

Hewliitg ’"Studenta" f«»l>er and Fisher’s 1)e ^^ Q S 
in almost complete agreement with both; w ,{ ^ p°f2 

them to sgrw with each other. On the m .yjott 08 ; g jix 0 ^ . 

<.;(iiii«n whatever, mid leave that toapecialt# 48 * ^ nora na „,n in tlJP ^, e nt9 in 

p* w rong in regarding Fisher an an advocate o e * 


’ f »y 0t<,!11 r the 

ps wrong in U flt 01 . a iie8 01 „,hefl n P 

Kwiier haa not aufficiently emphaafeed the an e yaf» b y ^occ^ 
Consider m arrangement intended to eompa . j; pro c ®® - &n the ®' 

three S x 5 »|uare». A die -hard randomi»t 51 L )t b ert ' b he Voc0 ( J u 

plate in order. I to 75, and throw a die t ill numb 'i & five,' i6 ty JO^ 

IS tirnw*. If the nth throw, neglecting «<°®* 8 ’ y o» fcft . wh eI ’ e >nt , 

plot to the fifth variety, and no on. He might be tl piff er 

with »n« aeries of random numbers, The re» u jjy he ^ co^ c0 HJp arl *! ha t 

not appear at all in one square; there wou r° ftll d r0 bah^ e ur 

game variety occurred two or three times m des^ 0 ’- v eiy ° C 

varieties would not occur equally often m , g e3C o60» e ^ch 
between them would have different weighty A y igber 
the result would Ik; three Latin squares, Vv ‘ e 
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equally often he introduces ora- systematic mdrelten. when hr ap$4w *b»* 
to each square separately he introduce* another, when he make* s? w nir jurt 
once in every row and every column he introduces others The ^m?4rte 
procedure has never, 1 think, been adopted in auch « w It m«v Jr Um «h»nr>l 
to have even been considered; hut statement of it shows h«* f*r W 

gone in the direction advocated by "Student", The latter tfetit f**-t J*nnt ««rt 
that the Latin square is both random and balanced. hut the two pteipcrriw refer 
to different features of the design. The systematic feat ures enable the maximum 
accuracy to be obtained for each variety, consistently with uniformity. Hut I 
think that some further at tention should Jtc* given to the function of the r*wt*>*re 
lying of rows and columns that in carried out after the main feature** *4 the 
Latin square design are fixed. There would In* no point in tin** if the probabilities* 
of the plot errors, given the varietal, row and column value#, w ere all indr’pmtenf 
The hypothesis required by the method of least square# would be «tn*f»wl «»d 
the analysis of variance is equivalent to the method of toast squares applied *« * 
system where the normal equations contain only diagonal term# Un the other 
hand a systematic, ground effect that upsets the indejwndwiw of the «*rr»«r% w«»uhl 
require the explicit introduction of a new parameter to express it, this would ««»t 
necessarily enter orthogonally to the others, and the analysis «*f vananre analysis 
would have to be replaced by a detailed solution by least square* Sow the 
absence of such terms cannot be guaranteed. To illustrate hinv they ran enter, 
let us suppose axes of position taken parallel to the sides, with the mitre of the 
square as origin. Then if (x, y) are the coordinates of the centre of a phi!, wtpjswp 
first that the fertility may be expressed in the form 

F = a 0 + a 1 x + 6iy + a s x s 4-6 4 »/ a f ... 4 b t y*. 

The elimination of rows and columns, in a 5 x 5 square, allows for and eliminate* 
all powers of x and y separately up to x* and t/: evidently any art of mm and 
column totals could be fitted by choosing the nine coefficient# amiably. Hut. 
neither the design of the square nor the analysis of variance technique would 
deal with a term like xy, which might he present. Whatever its cncflictent may bp* 
it will contribute nothing to the row and column total*; but iVy for all plot* of a 
particular variety will not in general vanish indeed it is possible for xy te» have 
the same sign for four of them, vanishing for the other. Thu# the pre«w of 
such a term will make a contribution to the estimate* of the varietal difference 
unless provision is made against it. Now this is not merely a, theoretical danger 
If terms in ** and y* are relevant it will be only for one particular pair of dirnffhin* 
of the sides that the coefficient of xy in the fertility will vanish, md ft mart, be 
presumed that if there is reason to attend to x s and y* we should also attend to «, 
The most accurate way of doing so would he to introduce it explicitly into the 
equations of condition and estimate it by the method of leant aquartw. In etti- 
mating the uncertainty allowance would have to be made for the fact that an 
a i lonal parameter has been estimated for every square. But since ft will not 
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orthogonally to the varietal differences the labour of calculation would be 
greatly increased. Again, if and 1 / arc relevant, there is reason to expect 

and xf to be, and m on. The analysis would become prohibitively difficult. 

>mw this is where the use of randomization comes in. In a single 5x5 Latin 
aqwamit i« jmjsamihlefor Iry to vanish for every variety. (I do not know whether 
it would he possible for larger squares, hut even if it is, higher product terms 
would require attention.) But it is possible to arrange the design so that, without 
evaluating the contributions from xy and analogous terms explicitly, they will 
contribute imlejHjndently to the totals. This is done by repeating the square, 
using separate acts of randomization for each square. It is important to notice 
that the object would not he achieved if the first square was simply copied. If 
this wa« done, the values of i'xy for each treatment in one square, and for one 
treatment in each of the others, would determine the values for all the other 
treatments in the other two squares, and the hypothesis of independence would be 
wrong. The separate randomizations ensure that, with three replications, the 
contributions of the xy and similar terms to the varietal totals are each the sum 
of three independent components. Omission to evaluate them explicitly may 
sacrifice information that could possibly be recovered, and thereby lead to loss of 
accuracy in estimating the main effects; but so far as that is a criticism it does not 
apply here to the randomization hut to the analysis of variance technique, 
which will be valid in the*© circumstances only if some artificial device is intro¬ 
duced to convert the, systematic disturbance into one that can legitimately be 
treated m random. The real justification of using this technique rather than a 
complete least squares solution, taking into account xy and possibly higher 
product terms, is practical convenience. The justification of the separate randomi¬ 
zation in relation to it is that it prevents differences from being interpreted os due 
to varietal or treatment differences when they are really due to the accumulation 
of neglected ground effects. To advocate a least squares solution for the neglected 
terms may welt be analogous to saying that an investigator should carry out all 
computations to seven figures when he knows quite well that the second will be 
uncertain in any case. 

If Deem* to mss that Fisher sums up the situation very well in his advice to 
balance or eliminate the larger systematic effects as accurately as possible and 
randomize the rest. It provides against two dangers: first, that the main object of 
the work may be lost by an imperfect estimate of the larger effects, especially 
through great departures of the normal equations from orthogonality; and 
secondly, that a neglected term, small in any one observation, may mount up 
when many are combined, But what is worth balancing, what is worth random¬ 
izing, at the cost of a small increase of uncertainty, but not worth definitely 
eliminating, and what is random anyhow, must be a matter of the particular 
problem. The only rule is to attend to actual conditions and the types of syste¬ 
matic variation likely to arise in them. I should see no point, for instance, in an 
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elaborate sampling (if people by a fixed lint of random lmmlwrn whro a to* w 
alphabetical order is available; if uniform interval in mb a M tin n«.i «»** * 
random sample in the hmiho of the independence of the errow f do not know w bs»i 
will. The only systematic variation that could bias the mean would lw # portodi 
city in terms of the position in the table, the wave-length being an r**i *ub 
multiple of the interval chosen; and this minis too remote « j«wuib»Ssty to nwl 
much consideration. But a similar difiieulty might mm in relatum to oils* 
experimental layout mentioned by “ Student ' , namely 

All BA ABB A . 

Here, as “Student" indeed remarks, the estimated difference Mwmt A and H 
would be biased if there was a lmrmonio variation of fertility peritel p* m 
odd multiple of the width of a quartet. He, dkmkw.H fbk a* a not jwrtwuforly 
likely occurrence. But land is often ridged in just this way. *md an unwary 
designer might easily lay out his experiment in audit a way that the dsffrmop 
sought could not be separated from the periodic ground effect, in much land ! 
should say that the harmonic effect is likely Ui be the dominant ground pflrct 
and that the best treatment would be to introduce it explicitly into the equation* 
of condition, and lay out the experiment so that the possible phases are uniformly 
distributed both for A and B. 

In the 5x5 Latin square, there is no logical reason w hy wo nhmihi *»top before 
an expression of possible ground effects by an expression of twenty-five term* 
■with adjustable coefficients; then there would be no mean* left for »pa,rating 
treatment effects from ground ones. Equally there seem# to W none for not 
treating the whole variation in terms of block mean, treatments, and murium 
error. The place to draw the line between these two extreme* cannot W decided 
by pure theory; it is indicated by previous experience, which has indicated that 
some types of ground effect are habitually significant, others umkmmdh 
significant, and others legitimately treated as random. What Fisher’» random¬ 
ization does is really bo combine members of the second clam in such a way that 
they may be included in the third. But extreme randomization would also' try to 
randomize the first class, and the disadvantages of this worn for to outweigh any 
possible advantage, 

However, I must now return to a subject where the normal ©quatfoiw arP 
never orthogonal; whore the normal law of error never holds; and where it k 
impossible to separate the ground effects in different regions from jmsaibfo 
systematic differences between the observers inhabiting those regions, 
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A NOTE ON NORMAL CORRELATION* 


Bv K. J. (i. PITMAN 
rnmtmh, of Tmmnia 


I s n m*M Finney f1 113#) discussed the distribution of the ratio of estimates 

of th** tint variances in a sample from it normal bi-variate population, and showed 
lent a test for tngnificanee could be applied when the population correlation 
wffirind is known. He also allowed how the test may bo adapted when only a 
,wimple mtuimle of this correlation is available, but this adaptation is not com¬ 
pletely satisfactory. This note shows how, by using a different distribution, an 
exact test for significance can be obtained, and fiducial limits for the ratio of 
the jHjjmlatinn variances determined, when the population correlation coefficient 
is unknown. 

Suppose that .r. y are normally correlated variables with probability function 


2,7(Tyr n il 



> “ n) 1 „ (x - a) [ y-b) (?/~/ ; ) a 1 ] 

. 4 w . 4 Jr 


and that x v y t {« » 1,2,..,,n) 

are n pairs of observed values of x, y, Write 

X a L'xjn, y » I'yjn, 

A'l a >S^l'(y g ~y)\ 


£(x,~x) (y t -y) 

:*,-*)* .2®!-|j*]‘ 


Since 


2(1 —/>*) 


'(*-«)* 2 (■ x~a){y-b) + (y-^) s ' 


art 


IT, IT, 


I u 8 


^ J 


1 


" 1 

\x-a y~b\ 

2 1 

-L ... 

fa?-a y-b] 

|8‘ 

j(i+>y 

\ ff l j <*% i 

2(1-/;) 

[*1 J 



it is evident that u « —+~, w * — - —• 

are independent normal variables with variances 2(1+/;), 2(1 -p) respectively. 


Thus, if 




4 . $» v ~ x j. _ Pj, 
°i + < a ~°i o-*’ 


(a - 1,2,. 


the n pairs of numbers constitute a sample from a normal bi-variate 

* [This paper and that which follows by W. A. Morgan were received for publication at about 
the name data. The subject of both was suggested by D. J. Finney’s paper, but the two treat 
manta and applications of the improved solution of bis problem, when p is unknown, follow quite 
different lines, Ed.] 
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population with zero correlation. Hence tin; dwtribution of their mmdalum 


coefficient, 
is known. 


M * tiiK-umrFm 


R 


if.- 

I ^ <h \ Si 


.V, .v 

ff * 


.ft.-* r( x > 

TV s,.) -n 


<h 


*/ $ 


it: 


if, 

. «**. 

ff? a! 




_.i 4. 

,<rV<rl 


«T<r 4 I \(t 


.s' 

♦’a 


cri 


( V r * 


*S‘x 


a\. a 


\(T, 




■ 4r® - 






Putting 
we have 


w « <r\l<r\, w * AYS’,. 


1? 


w~w 


>/{(«» +ft/)* ~ 4 r*K , w}‘ 

The values of w and r arc*, given by the sample, hence, from the known dt« 
tribution of R, fiducial limits for to can be determined. If we merely wwh to tout 
whether the values of A\ and A' 4 are significantly different, i.e. if we win!) to tost 
the null hypothesis w = 1, we simply insert this value of w and test for «iigniiir*u w 
the corresponding value of the correlation coefficient R. In determining fidm mi 
limits for 0 ) the arithmetic is a little easier if we make use of “Student V' s*4i«» 
tribution, or Fisher’s 1-distribution. We know that R % has a R( Jn - I) din- 
tribution, and that 

R_ _ w~u) 

-JP)“^4(l-r«)ww} 

is distributed like "Student’s” z for a sample of »-1, while 

a/(1 ~R*) V{4(I-f*)ufw} 

is distributed like Fisher’s t with n- 2 degrees of freedom. 

If a is any given number less than 1, wo can determine k such that 

P{j 1 1 <; k} » a. 

The inequality j 1 1 ^ k is equivalent to 
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where 


K 


1 + 


2( 1 - r*} k* 

" n-2 ; 


heitre .1))£«Sir(£+ ,/(£*-1))} - a. 

In the example given hy Finney (IH3H, p, 15)2), 


n 


w 


»S\ (5*2(li)) 8 

.S’* “ (4-?M)*’ 


f - 0*878, 


Substituting these values in the expression for l, and putting w = 1, we obtain 
f - 2 * 18 ) 2 , As n is large the distribution of t is approximately normal with zero 
mean and unit standard deviation, and therefore the value of/is significant at the 
1% level. If inav be netted that in this particular case the exact test gives less 
figiiifieiinec* than Finney's. 

For comparison with the case of uncorrelated normal variables we may note 
that if /t * {), w 

w+u 


has a //{!(« - 1), |{« - 1)} distribution, and therefore 

(to- w) s 
(to-I- w) 8 

has a K{\, \{n » 1)} distribution. As shown above, for any value of p, 

R % m . . 

(w + oj) s ~ 4r 3 wto 

has a 1% 1(« - 2)) distribution. 

The same method gives very easily* the distribution required for the estima¬ 
tion of p when the standard deviations of x and y are the same, oy = ay = a*. 


hence 


u = x+y, v = x-y 

are then independent normal variables with variances 2cr l (l+p), 2a 2 (l ~p); 

E{u t -u? E(v t -W 

UV +>)’ 2cr*(l -/>') 

are independent chance variables each distributed like y a with n- 1 degrees of 
freedom, and so the distribution of their quotient 

l—o X’(u 8 -~w) 9 
l+p A’K-ii) 8 

is known; ia/(l+w) has a -1), |(n — I)} distribution. The expression for 

Wk 1-pl+r 

l+p 1— r’ 

2£(x a -x) {y 3 -y) 


w ' 


where 


Z{x g -xY + £(y s -yY 


* CL Do Lury (1038), where the same problem is dealt with by a different method. 
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Ifx and y have the same mean value « as well m f !»■ *»».«• M^»d.ird *\ 
the mean value of r w 0, and therefore 

av; 

lV(l p) 

is distributed like ,\' s with n decrees *»f irmhini. Thus the 4w><nl*u 
quotient, j f , ■ nf 


t -• p 


w 

* ’M 


is known; wj(l + «•) has a U[\{n ~ 1}, |w| dbdrihutiots The e%|«rr*n->n u »■< 

1 - p 1 f f 
1 tff * - r 

22 ‘f,r,.£! f!/, £} 

£(*,-£)* tl r 

i -■ i lh) in. 

The procedure in this ease corresponds exactly to the treatment »d f.wlrjn 
correlation by the analysis of variance. 


where 


r *i 


UKKKHHNrKH 

Finney, I). ,T. (1938). Womtrikn,30 t Hw 2. 

Da Lury, D. H. (193H). /Inn, ittUh. fiinM. % 2, W iil. 



A TKST FOR THE NKiNTKK'ANCE OF THK DIFFERENCE 
BETWEEN THK TWO VAR IA NOES IN A ,SAMPLE FROM 
A NORMAL BIVARIATE POPULATION 

Hv W. A. MOHIJAN 
1 rtf iStotiriin, I’nirmily I'lillegr, l/nultm 

I, DkRIVATIOX OF MKRUHtlOn RATIO TKST 


I s;»published in a recent issue of this Journal D. J. Finney (1038) con- 
ludr-wi the follow rng quwtiorm, A sample of n pairs of variables {x, y) has been 
drawn from the bivariate normal distribution whose probability law is 


, k 1 

ir ,, , 4 pxp 

^/7*r } ' r 2 s { I -pij) 


1 1 


- -“dla)! 

1 


’-Pit" ,, + 

17 1 rr s \ ,r : 


•ID 


(i) What is the probability law of tho ratio,«, vv 


to 


“ *il»t 


.(2) 


ami UR* - V(x, -x)*;(,i~ t), «J 2 = £(j r -?)*/(»- D ? .(T 

i t 

(ii) Could this ratio be used as a criterion t o test the hypothesis that tr x * rr s 1 

Using a more direct method he was first able to confirm a previous result 
of Rose {11)35), giving the probability distribution of to in the case where tr x - <r 2 . 
and hern* to show that the chance that to exceeds a given value, say Q, could be 
obtained from the Tables of the. Incomplete Beta Function (1934), using the 

*>DW.( 5 T.W).W 

whM . (e) 

Since the probability expression (4) is dependent on the population corre¬ 
lation /q s , which will be in general unknown, Finney pointed out that the ratio to 
was not altogether a satisfactory criterion to use in testing the hypothesis tr l « p t , 
but he put forward a possible method of getting round this difficulty. 

The question arises as to whether there is not some other more suitable 
criterion for testing whether the variances are equal, whose sampling distri¬ 
bution will be independent of p n and of any other parameters whose values are 
nob specified by the hypothesis itself. The likelihood ratio method of approach 
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of Neyman and Pearson may he followed, thin mrttawl hm V x**M <4 

in a number of instances where the appropriate rntenmi *4 Mnwtately 

obvious. Summarized briefly, it involves the following #t*l* 

(a) A specification of the set of admissible hyputbrn* Jo the pjwwnt c*#e 
these will be defined by the joint probability law of the » J**n* of «»W|\ 


p(x v ?/i.!/«| si> ba> 0*1. ff a> /’ml 

t Jt | i'jT —• £, | 1^' 

...,,ytl tr' 1 ' * fn 

*\JPn r «V*4f*r 
\ or% } <r\ <r x ir t " <rjlj' 


* t M it i j | 
, . ffif 


where - oo ^£ v £,£ ao, (t*4 cr, ,<r $ $ •«. •• f '.pj| % * 

In this expression 2 and y are the sample mean*. r n the aaiuph* a-orirktem 
coefficient, and 

«?« T,{xt -£)*/»< 4 » ££v« -■ y P.». <»f 

i i 

(6) A determination of those values of the live unknown iwraturter*. m 
functions of the observations, which jointly maximize the rxprrwmm The 
solution is known to be obtained when 


£i * 2, it m V> a i * «* <“ *|. pit f ssr t*? 

The maximum value of ((!) is thus 

Pi[m&x) * - r^j , fS| 


(c) A specification of the hypothesis tested. This hyjmthwM that the 

probability law is of the form 

jP(*i* Vv I ip U rr , pub 
where the function is obtained by putting <r x w cr i m a in {Cl). 

(d) A determination of the values of the four unknown parameters (•„, f t „ «*, 
and p u which maximize this expression. These values may 1* ahovrn u, t« 

= it ~ y> <r - v{j{«|-t «f}i, p n m 2r J9 «,%;{s|»-«|). ..|iu| 


The maximum value of the probability function defined in (r| Own 

p a (max) * {errVt(«!+*|)* - . 111 ! 

(e) The likelihood ratio criterion is then 


Pr(tnax) “ 




_ {. («!-**)* 11- 
l 

{/)• The hypothesis tested becomes less and less likely as A moves from I to 0. 
To complete the test it is necessary to know the sampling distribution of A, or 
of a single valued function of A, when the hypothesis tested is true. 

It will be seen at once that the test differs from Finney’s because tfa» criterion, 
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unlike Itm m, in a fund km of r J# a* well tw *j and n\. The meaning of the criterion 
which ha® hwn picked nut by the A-method! becomes clear if we make the fol¬ 


lowing transformation of the original variables: 

Write x « A’ + Y, y«A f -F, .(13) 

m that X « |(x + y), Y « |(.r-i/). .(14) 


Then the population variances of x and ij may be expressed as functions of the 
vamni'i* and correlation for X and Y, as follows: 


.(1C) 


ff® * rr\ + a\ + $PxY cr x (r r< 

<r\ * <r\ + erf, ~ 2 p x Y cr x (r v . 

The rawssary and sufficient condition that the hypothesis tested is true, or that 
<r x -» «Tj, is that p xy - p. .( 16 ) 

Hi lire A’ and Y are normally correlated variables, the appropriate criterion to 
teat the hypothesis, p xr * 0, is the sample correlation coefficient between the 
transformed variables, i.e, r XY . If the hypothesis is true, this coefficient has the 
well-known probability law 

p( r xr ’Pxv 33 0) » constant x (1 - rf-j.) l(ri 41 . 

Making use of the transformations (14), it is found that 


.(17) 


4- 


.(18) 


,...(19) 


Hence the likelihood criterion of (126) is seen to be 

and as the hypothesis tested becomes less and less likely, A-+0 or r$ Y ~> 1. The 
test may therefore be carried out by (a) referring the r xr of (18) to the probability 
distribution (17), or (6) alternatively referring 

r xy J(n~ 2) 


t-- 


.( 20 ) 


V(1 ~ r xy) 

9 to “Student’s” distribution with degrees of freedom f~n- 2, and (c) rejecting 
Kit hypothesis when \ r xr J or ] 1 1 fall beyond the desired probability level. The 
test, it will be seen, is independent of the unknown correlation p n between 
x and y. 

2. The power or tub pest 


While the probability distribution of r XY is independent of p n if the hypo¬ 
thesis (<r* -- cr a ) is true, the chance that in using the rejection rule of the test 
we shall detect real differences between oq and (r s will depend on the value of 
p n . Suppose that we fix a probability level r(a) such that 


a = 


Ka) 


c(i-r s )K' l - 1) dr > * 


.(21) 


where a, for example, equals 0-05. 

* c is the constant of the probability law (17). 
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Then the chance of rejecting the byp<*f!»-«iK that 

7 <>\ "j Y 

when y ~~ y, + l,'« given by t he expnwiut 

t‘{r\y> r-{x) \y ■/,! W >r*f*l F - ih * ”1 

*r: tl 

1 j |t!f-j*3| 

J n i >i 

This expression Keyinnn & Pearson fUWM have u-rmrd «hr l««ct of the te»t 
of the hypothesis that y -a <iy<r, * 1 with regard to w .il*«TO»*nre t.*H)re*re 
y «y 1( hv the expression (2M>, p(r fo> pj Aw* »Hr general p»>.h»bih«,v 
for r in samples from a hi variate norma! population. lirst *«hte»i**d H it \ 
Fisher (11)15). It will he seen that a relation siitaiar ?«* »!** h*-M« Mworn 
p xy , y =s fr,/<r a and /> Ia , namely 

.. _ TrX!. ... /«*•{ 

AV " {(y - y ‘l*-» -Hi 


Owing to the symmetry of the distribution of r when /* • «*. the pnrm of tire *#»!( 
will he the same for alternatives p XY ami - p v} , it will therefore W the Mine !«*r 
alternatives y and y h p, n * example, the test i* rts likely It* reject lire In jwdhrsu 
(<r t m cr a )when rr, » 2cr s HSwhen n l » )<r 3 . This is clearly wh»! w*» t»h»»uld •a|«'d, 
as ctj and o-j are in no way differentiated, 

In Fig. 11 have shown the power fundiim of tire fad, taking a «•d In* and 
sample size n m 25, for alternatives y > 1, in the three mtm p u ** *», pu ~ « A, 
andp u = 0*8. It will he noticed that the test is more jm«-«riul whrn /<„ re let#** 
This of course follows from (24), sine® for a given value of y, p fJ * will be further 
from zero the nearer j p n | is to unity. 

The computations were made with the help of F. X. David ’» recent i v publish**! 
Tables (1938), The work was simplified by taking p xr at wmvertkuil vaim** 
0T, 0'2,.... 0-9, and finding the corresponding values of y by mean# «*f (21) 

The table on p. 18 shows, in the columns headed Test fa), the vahmutf ilw 
power function computed in this way for this case of n * 23 and *l#u f«r a « if 
and w = 100. 


3, COWfABJSOH WITH ETnNOT’s TK8T t» TO* CASK WHfcRK f> & 1# KSttKM 

In the case where p n is not known, Finney has suggests! thathia tost criterion 
o might be used by making a double appeal to significance levels on the 1mm 
proposed in another case by Hirschfeld ( 1 937), It dm* nut, however, appear wwy 
to determine numerically the power of the resulting tost. In the «m» wh«» $t m 
is known, it may be shown that the likelihood ratio criterion now heomnsa 


i + VaU-'u/J + , 

* This means that the hypothesis is to he rejected when r X r falls Sx 
bher tail of the distribution (17), or for the men = 25, when j t xr j > p. 


-- - ' v *VJpvrv*wM» » w mi ICJCUWJU WUW r 

either tail of the distribution (17), or for the me n = 25, when ! 


beyond Use S% tew! is 
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Tkm it ip 4 mWy » fjinrfjon of Kinney'* criterion «/, since it depends again on 
the **mjdk f n Tlw form of this expression is interesting, as it shows 

il ft u tn kmm n and differs from rem. in view of the correlation which exists 
hot wean * t . »nd r >r * the sample values of all three are relevant in examining 
the hvj»f<}ioni* that *r, « t 3 1 have not xucceedod in determining the sampling 
dj#f nhurion of ihr A of (2hi. or of any single-valued function of A. 



In the cose of p n known, it is, however, possible to compare Finney’s tost 
(involving p lt ) with the likelihood teat appropriate in the case where p n is 
unknown, but still of course applicable when it is known. The power function for 
Finney "a test may be computed as follows. 

When using this tost the hypothesis that y *» 1 is rejected if io>Q or if 
w < Q l , where St i« a constant chosen by using relations (4) and (6) so that 


fm 


p(<o \ y « I)rfw a |a. 


.( 20 ) 


The power of the tost with regard to an alternative hypothesis, y + 1, is therefore 

rir> r«> 

given by P(y) = 1 - P{Q~ l < w < Q) = p{u) | y) do) + p(v j y)dw, 

Jo J a 


where p(&> |y) « constant x (1 -p**) 10 4 

fW 


y<o 


■4PU 


See K. Pearson (1013). 


.(27) 

......( 28 ) 


Blometrika xxxi 
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The above integrals may be transformed to give 

jtt Inti . |5' 1 * 1 t j'»n 

S{y) ~ "it ’ jt j'* ^*t( 2 ’ 2 ! 

where ** 1(1 + (y&-*y ~ X U l )\H¥& 7 1* * ^* **•*’'M < v >t, 

and ,r s - ill - (y HJ- yU »} v |(y Hi- ySt ¥ * «» ' 

so that values of the pmver function may 1 h* ealrulatrtl wing the T«N** «'/ ^ 


Comparison of jmeer [unelumxof (n| t.>k<Hh‘?4 ua >»« »^ r 
(h) Finney * leal (him! mi wl when p n w knmm 



**•*.»«- '»■ * '«"*■■*■•' 
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fartmipkt* thta Function. These values are compared in the table on p. 18 
above with theme for the likelihood ratio test. It will be seen that: 

(1 i When p n ® i.c. when the t wo variables are known to he independent,, 
the i»f hawed on w or the ratio of sample variances is the better. This is a result 
already known, but the table shows how small is the difference between 
the frets. 

121 When p tt is 0-5 or 0-8. however, the likelihood ratio test is somewhat more 
sensitive for the small departures in y from unity, and less sensitive for large 
departures than the w test. Practically, this means that when p n is known, the 
m tert jk slightly the better, since we are most interested in situations where the 
chance* of detection of a difference becomes large, say greater than 0*5 at any 
rate. I t ift jKMjsible that a test based on the criterion given in equation (25), if it 
could be obtained, would be more powerful than either of the other two teats 
when p 1S t w known. In practical cases, however, it will nearly always happen that 
p u is unknown, and in such cases the r xr of (18) appears the appropriate test 
criterion to use. 
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THE distribution ok ranok IN SAMH.ES KBt»M A 
NORMAL POPULATION, EXPRESSED IN TERMS iW AX 

independent estimate ok standard deviation 

By I). NEWMAN 

Department of Stntintim, Vnwmly t‘nIhp, hmfam 


1 . 


Stautinu from the contribution of Tippett tinz^i.« nmwdrtnhk amount nf 
computational work has hmt carried unt in mvul ymrn with the »4 

making possible the u«e of range, U. the distance Mwiwi the high*** «n»l !'*«■<»» 
observation, when dealing with samples from a normal pnpul»ti«n thw 1 wH' ** 
tables of the mean range expressed in terms of the j#i(u»hil»«*n standard demtron. 


or, for sample sixes n * 2 to 10<K) have been republished w TWmJW ,Swjwwm**m 
and Bimeirieiam , Part II, Table XXII {K, IVarsnii. H*3R I^tpt K S IW^oi 
(1932) gave a table containing the standard deviation of rang*', awl the 
approximate upper and lower 10, f>, 1 and 0-fi % probability level* P*f 
sizes n * 2 to 100, again in terms of tr tut unit. In doing Ibis he tt**4 *nnpir«ml 
Pearson-type curves with correct momenta, and checked Sits nwtiH# *jt*m*t xonn* 
experimental sampling distributions. 

If a number of small samples are available, it Saw been *ht»wu that a repel 
estimate of or may be obtained from the mean value of the range, whirl* i,» only 
slightly less accurate than the estimate obtained front tin? mm* of wpmre*. Again, 
owing to the high correlation between range and standard deviation in % sample 
of size 10 or less, it was pointed out by Pearson k Haines {that rum#** may 
be usefully substituted for standard deviation in control charts M*e«l i« study 
changes in the variation of quality in industry. In all three enure, bow wr, tin* 
basio sampling distribution used has been that of the ratio of rang** U» «t\ 

Not very long ago "Student” (the late Mr W. K, Uowwt) sttgtfretrd to Prof, 


E, S. Pearson that it might bo useful to know more about the sampling, 4wt rdmt ion 
oftheratio 9.10/4, 


where w is the range in a sample of n observations from a normal population with 
standard deviation <r, and T is an independent and unbiassed mtimaUt nf a* 
based on/ degrees of freedom, oh tained from a sum of squares in the umml manner. 
The type of problem which "Student" had in mind was on® in which, an it reww.il 
of an experiment, a number of 'treatment’ means, say $ lt Z v are available, 
and also an independent estimate, s 2 , of their sampling variant®. Th®n a rapid 
method^ of judging whether any treatment differences exist would in 
comparing the difference between the highest and lowest treatment means, my 
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«’ * f n - jfj, with *. Should thin difference be clearly significant, having regard 
to the values of n and/, the more divergent of the extremes, say x v could be set 
wide, and the differencex„ - Jr, compared with h, using 1 and/. This procedure 
would i« fort be similar to that suggested by “Student" himself in his paper on 
“ Errors in routine analysis" (1927, pp. 161-2), except that the ratio used would 
now be «?/# rather than mpr. Of course* the probability levels for the former will 
tend to those of the latter ratio 

In the following sections I shall first make use of the results obtained by 
Prof, Pearson in computing probability levels for v'jtr, to determine appropriate 
levels for ti'X and then illustrate “.Student’s" suggestion on three practical 
examples. It should be noted that in a recent paper H, 0. Hartley (1938) has 
suggested a systematic method of obtaining probability levels for “studentized” 
functions. It is hoped that before long fuller and more accurate tables may be 
available to supplement the present tables which rest to some extent on an 
empirical basis. 


2. TltK KXI‘ROTATION OF ([ = wjs 

Before describing the method of quadrature by which the probability levels 
were obtained, it may be useful to give a table from which the expectation of q 

m 

for various values of n and/ can be calculated. Since a i ~ we have 

£*» l 

for the probability distribution of a, 

V(g\ * .- . f— - (I) 

If we write p(w) for the probability distribution of range, a function whose precise 
value is only known for the oases n = 2,3, and denote an expectation by the 
symbol A, we have 


fco 

%)» p(q)qdq 
Jo 

a 

m A p(w)p(s)dwds l since w and s are independent, 


(•<0 foo 

Jo Jo 


t* «j 

> wp(w)dwx 
Jo 

p«> 

<](w) x I 8" l p{s)di r. 
Jo . 


r x p{a)da 


= E(w) 

Since the values of A(u*/rr) for changing n have been tabled by Tippett, it is only 
necessary to consider the integral 


f u 


ers ” 1 p{8) da gitf-ai T[\f) cr 1 -- 
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Hence it follows that 

E(q) ~ E(ii'j<r) x yf I/) nil/-1}| H1/^■ ’ 

A brief table of the second function is given below; the value# of Sl«* may ta* 
obtained from Tippett (1925). pp. 3HB ", or from Tahk# f„r SMi*ho«n* ami 
Biometricians , Part II, Table XXII. 

TABLE I 

Factors by which to multiply E\u'«r) t«> obtain h\q) 

Degrees of freedom 3 .*5 7 l (t 20 3*' 

Factor 1-383 MX9 H2U 1-084 H*4» M'2« l •'*«* 


3. Computation of table of 5 and 


1% SKtJitFU’ANFK M'.VI'.I-H |'t»K 


u r .* 


The problem is to determine, for different values of/»mid/, vhUm-**/, t-urh that 



p(q)dq 



"‘•■'it 

p(«)d#}du\ 


o 


■; 3 | 


where a - 0-05 and 0*01. Since the distribution of </ will dearly k* i»idej»pu'lent 
of the population standard deviation rr, we may take a as unity in the probability 
functions used in (3). Except in the cases n • 2,3, which will In* referred Ut again 
below, the procedure adopted was as follows: 

(a.) The ordinates of the empirical curves, my j/(u;|, obtained by Vi. 5*. IVar#* <n 
(1932) for the cases n ~ 4,0,10, and 20 were used in phut- of the unknown p\n). 
These ordinates had been calculated at equal intervals of 0* I for«?, tin* imputation 
standard deviation being the unit. 

(i6) Taking a trial value of (/*, the integrals J(w, qj « | p(s) da were 

Ja 

lated, with the help of the Table# of the Incomplete Gamma Function (K, iVstwin, 
1922), for each value of w used in (a). Thus J(w t q u ) • l{u,p) in the notation of 
the tables, where 

T-lf-l (II 

(c) It was then necessary to apply quadrature to the product# #{«*) < 
calculated at intervals 0-5 for w, through an much of the range w » O to «• ?.« 
as was necessary to obtain the required degree of accuracy. 

(d) The resulting expression would of course not correspond to a ex telly, 
as q a had been guessed. Other trial values were taken for q xi and tin- final value 
corresponding to a = 0-05 or 0-01 obtained by backward interpolation. Starting 
with the case n = 2, where the exaot value of q a could be obtained for all /, and 
knowing for n> 2 the limits to which q a tended as/-* oq, this proems of trial and 
error was not found too laborious. 
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Sprr inl cam when n ~ 2 

in this ciw', taking tr « 1, the distribution of w assumes the simple form 


p(ir) » | e for 0 w < m. 

\J7T 

Hew* the joint probability distribution of w and s is 

* 2W %i /’{J/) S/ lfi ! " ! i/ * 

Transforming to variables </ » m/a and a, since 

|2(m,a)J 


(5) 


(0) 


we obtain 


(7) 

(«) 


%<?) 

p(y,s)« constant x aVitfp+in 

Now integrating for a between the limits 0 and co, we obtain for the probability 
distribution uf q 


Pin ) a 


A 


i'{W+v)i l+ ?!\- mn 


for t/M). 


(9) 


i fn! /’(*/) \ W 

This corresponds to the positive half of a “Student" distribution having/degrees 

of freedom. Values of q a satisfying the relation a = J>[q)dq may therefore he 

J <h 

obtained from R, A. Fisher’s (1938) tables of the percentage points for t. Thus 

(10) 

where t x will be respectively the 5 and 1 % levels for t* 


Special case when n - 3 

For this ease McKay & Pearson (1933) have given an expression for the true 
distribution of w. Tire quadrature method employed when n > 3 was again used, 
but the true values of p(u>) were taken, and not the ordinates of the empirical 
curve upon which E. 8. Pearson (1932) based his original tables of percentage 
limits for w, 

The following table shows the framework of values for q 0 . M and q m obtained 
as lias been described. Values for /» oo were of course already known, and 
values for n = 2 and 3 are exact. 

From Table II, the more complete working Tables III and IV were obtained 
by interpolation. It was found that the changes in percentage levels, both with 
increasing n and /, ran most smoothly if the arguments 60/w and 00// were used 
in place of n and /. On this basis, interpolation from the framework values was 
effected, using live and six-point Lagrangian formulae. Various checks were 

* These levels correspond to deviations at which the ordinates out off 2-5 and 0*5% from each 
end of the (-distribution, but they are termed by Fisher the 6 and 1 % levels. 
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TABLE II 

Framework values for q m and q m 


\ n 



5% points 





1 {X .L 

1 m 

jstinln 



/ \ 

2 

3 

4 

(1 

H) 

20 

2 

3 

4 

B 

Hi 

m 

5 

3-64 

4-60 

5-22 

003 

7-00 

8-21 

5-70 

m 

7-83 

«-W 

in sit 

U 85 

10 

3-15 

3-88 

4-34 

4-92 

5-60 

e-47 

4-4H 

5-27 

5-77 

042 

7-21 

8-22 

20 

2-95 

3-58 

3-97 

4-45 

5-U1 

5-71 

402 

403 

mu 

fi-JW 


MS 

30 

2-80 

3-49 

3-80 

4-31 

4-82 

5-47 

3-89 

4 45 

4-7H 

5-23 

57 ft 

8 40 

00 

2-77 

3-31 

3-tin 

4-04 

4-48 

5-01 

3-04 

4-12 

4 38 

474 

feta 

ftU 


carried out, as for example that of comparing the values obtained by this method 
with the known true values in the case of n « 2, Finally, the figure* were reduced 
to one place of decimals, and these are given in Tables Ill and IV. 


4, Illustrative examples 

In the following examples the range test is used a« an alternative to the c-teat. 
the latter is, on theoretical grounds, the more efficient of the two in the mtm 
that it is the more likely to detect the presence of real differences if they exist. 
Both “Student” and L, II. C. Tippett have, however, held that situation* may 
be met, particularly when dealing with industrial problems, where the gain in 
speed following the use of range justifies the relatively small loss in efficiency. 
No doubt other types of examples besides those illustrated Wow will occur to 
the reader. 

Example. A 

Fisher (1937, p, 93) has given the results of a 0 x 0 Latin square experiment in 
which six.different fertilizer treatments were applied to a crop of potatoes, 

Denoting these treatments by the letters A,B . F, the mean yields per plot 

in lb. were as follows: 

A B C D E f' 

345 0 420-5 477-8 405-2 520-2 ISO]-8 

The analysis of variance table is as follows: 


Sources ot 
variation 

Degrees of 
freedom. 

Mean 

squares 

Rows 

Columns 

Treatments 

Error 

5 

5 

5 

20 

10,839-72 

4,893-45 

49,835-98 

1,527-05 






TABLE IV 


I % point# for q - wls 
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To teat the significance of treatment difference* tut a whole »f hnd •- 4M * < 

while for degrees of freedom/ 1 « r,./ 8 • 20. R.A.Fiaher',Mahl^-»v,.^ M 
There are clearly, therefore, significant treatment: ihform™ « iir 

dividual treatment mentis have been plotter! in tin* m ■ ..mium m,' figure, and 
the tabled significance levels fur q »i\* may be used, with dmwlmi, m a foot- 
rule in investigating the situation. 




Table IV above gives for n = 0,/ = 20 a 1 % level for q of 5-r>, confirming the very 
significant scatter of treatment means brought out by the z-tesrt. 

We may now ask whether, if we were to exclude the most divergent treatment 
F, there is evidence of a significant difference amongthe remaining five treatments. 
We find that w = 520'2-345'Q = 175-2, and, using the same estimate of standard 
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rrr«r, n \t\ Wu find that. q « i l* ft Thin value is still well beyond q Ml 8*3, the 
figure ohfumpd from Table IV u it h n ... 5,/ •« go. 

Making Mina'iwive triala w« that: 

lit fhmtfmg *4 and F, tin* range nf the. four treatment* B. ('. I), and E is 
stiJf significant, shire q ... 113*0 Ifl-itfi -» 7-2, while for n .* 4, /« 20 we have 

f ?<H» I ® AO. 

liil Omitting A , E, and F, the range of the three treatinenta #, 6\ and D 
is mgmfimnt, at the 5 % hut just not significant, at the 1 % level. For in this ease 
i * "2*<i 13*05 at 4*3, while for n a* 3,/ k 20 we find r/ M5 ■■■■■■ 3*0 and q a<0l * 4*8. 

I*ii) On the other hand, if we divided the six treatments into two groups, one 
rmwisting of ,4, D. B. and the other of (\ E, F\ we find that the value of q in 
both groups falls beyond q mx 4*8. 

We are therefore led to conclude that the high value of z obtained from the 
comprehensive teat cannot he explained by one, or even two, treatments differing 
from the others. It is doubtful, even, whether any three treatments out of the 
dx could he regarded a» f orming n homogeneous group. 

Two final points should he noted. In the first place, after omitting successive 
treatments regarded aa divergent, analysis of variance procedure could be applied 
to teat for significant differences among the remaining treatments. The calculation 
would, however, not be aa quick as that involved in the successive trials (i), (it), 
and (iii) above, using q, Finally, as mentioned above, the method followed, 
whether z or q is used, must be employed with discretion, as is always the case 
when otwervatlnm are rejected and a hypothesis tested using the selected data 
that remain. 

Example B 

A similar example has been taken from Snedecor (1937, p. 214). The following 
are seven t reatment means expressed in terms of bushels per acre, obtained from 
a 7x7 Latin square experiment, again comparing the effect of different fer¬ 
tilizers on potatoes. 

A H a n E F a 

341*86 363*14 380*57 360*43 379*88 388*29 387*14 

The appropriate estimate of the standard deviation of a mean of seven plots 
calculated from the error sum of squares of the analysis of variance table is 
s = 11*52, Testing the significance of treatment differences as a whole, it is found 
that z ~ 0*5574, while for degrees of freedom / t = (i, / a = 30, 2 0 . 05 = 0*4420 and 
z im ™ 0*0226. The ratio q ~ «»/a for all seven treatments is 45*3/9*52 « 4*7 a 
value lying between g 0 . 05 = 4*5 and </ Q , 01 = 6*4 (entering Tables III and IV with 
b = 7,/ = 30). Thus using either test we should conclude that there were prob¬ 
ably significant treatment differences. 

The seven treatment means have been plotted in the lower half of the figure. 
There is a suggestion that either if (i) treatments F and O or (ii) treatment A 
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were regarded as exceptional, the remaining treatments would fmin a \wmu- 
geneous group. 

This is confirmed on investigation: 

(i) Removing F and 0, we find t/ » 38-0;lt*fl2 « *Hh which is Iwhw the 
5 % level (q m = 4-1) for a - 8 , / « 3. 

(ii) Removing A. m find 7 -- *•■« 2 * 8 , which is well Mmv the b"„ 

level (g 0 , 05 * 4 - 3 ) for n ~ 6 ,/« ». The mure homogeneous gnmp apj#*«re U» k* 
left in the second case, i.e. on removing ,4 alone. Beyond this indication cannot 
go, as it would need a knowledge of the character ol the treatments t« draw more 
definite conclusions. 

F.rnufjilf (' 

In cases where a number of duplicate observations are available, an Mromto 
of variability may be obtained rapidly from summing the spumes *4 the differ 
ences between pairs. Thus * 

1 1 

and lias k degrees of freedom. We may now compare the range in the memo* ol 
pairs with s, in order to determine whether there is loo much variation k-twreu 
pairs having regard to the variation within pairs. It may be noted that an 
estimate of <r may lie obtained even more rapidly by calculating: the mean range 
in pairs and multiplying by O-HHfl'i,* ho that 

k 

s' - 0*8882 x X j ;r n - x a ) ik, 
t-1 

but since the sampling distribution of {s'f ia not that ofy*. we cannot justifiably 
take q = wjs 1 and refer to the tables of percentage limits I have given. 


Determinations of percentage fiirrr 


Analyst 




... ■*-*■■« 



“—T- "if 

. \ 

A 

11 

V 

D 

tf 1 

F 

‘i ! 

i 

H ; 

1 1 

1st determination (%) 

12*11(1 

12*51 

12*32 

13*15 

12*73 

12*4h 

12 30 1 

12-14 

ij 

ri-4» | 

2nd determination (a:, a ) 

12*47 

12*82 

■ 

12*55 

12*03 

12*43 

12-4(1 

(2 7,1 

nm 

■ 

13*44} 

Difference (d t ) 

o*to 

-0*11 

-0*23 

0*22 

0*3(1 

— 

0*02 

-0-43 

(HI 

urn} 

Sum 

25*13 

25*13 

24*87 

26*08 

25*16 

I ■%£ 

I 

25*03 

WWWMJu.iro.w 

24-17 

ISMffij 

* The reciprocal of 
1T28380*. * 

1 *12838, since the expectation 

of ran 

ge in a 

sample 

of two 

intlivic 
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The «kf» *!»mw» mii p 2H above consist of ten duplicate determinations of the 
prmdagr fibre m nsrefnlly mixed »m»j >1ph taken from flic 1 same supply of Soya 
r«H*m fake, mrh pair of valium Wing obtained by one analyst.* Ton different 
Aitalyufa were concerned. The* problem! is to determine whether those few obsem* 
tprovide* any evidenw of systematic differences in tcehn»}U6 between the 
a nalytef# 

The full analysis of variator i« «w follows: 



' 

Kum (if squares 

n.r. 

Mean square 

Within (wore 

<M 11450 

m 

0-041145 

JMwrni p*n« 

1-352045 

« 

0-150227 


1-7 034115 

m 



. 

. .... 

. . . . 


Testing for differences between analysts, we obtain 2 » 0*0475, a value falling 
between the 1 % (0*7)M)0) and 5 % (11-5527) levels. 

Using the range method, we note that the estimate of the variance of the 
turn of two determinations is 

1 k 1 lo 

*f«J to/-1 

giving an estimate of the standard error of a sum of 0-2Hf!K,t T’ho range in the 
ton sums in the table is w « 20-0824-17 * Hit, so that q * l >01/0*2808 ® 0-06! 
For n » 10, / « 10, Tables III and IV show q wl m 5*0, q Wi « 7-2. Thus the 
difference ia significant at the 5 % level, a result similar to that found using the 
z-teat. 

If now we omit determination!! of analyst I) (who gave the highest readings) 
we find}* (2S*9K-24-]7)/O-280H = 0-31. and is still significant at the 5 % level, 
since. for n * 9, / » 10, g M# ** 5>S. If, however, we omit, the determinations of 
analyst H (who gave the lowest readings) we find q * (26-OR - 24-K7)/0-2K(iS = 4-22, 
and is no longer significant. We should conclude, therefore, that except possibly 
in the ease of analyst H, there is no evidence on the data available of systematic 
differences in technique, 

I should like to take this opportunity of thanking Prof, E. S, Pearson not only 
for suggesting the problem to me, but also for bis kind help in putting this paper 
together. 

* The figures have been taken from more extensive data made available through the kindness 
of Dr J. F. Tocher. 

f The corresponding estimate obtained, as described on p. 28 above, from the mean difference 
between pairs is 0-291. 
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Range in sample* from a normal population 
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ON A COMPREHENSIVE TEST FOR THE HOMO- 
GENEITY OF VARIANCES AND COVARIANCES 
IN MULTIVARIATE PROBLEMS 

By I). J. BISHOP 

Department of Mali,vim, University College, London 
I. Introduction 

Now that satisfactory and probably final solutions have been obtained for a 
wide variety of statistical problems concerned with a single normally distributed 
variable, more and more attention has recently been given to the solution of 
multivariate problems, '['lit', multiple correlation methods of the old large sample 
theory have been replaced in many instances by others for which “studentized” 
test criteria are available, often having sampling distributions that arc already 
familiar in univariate problems. In a recent paper on “The statistical utilization 
of multiple measurements", R. A. Fisher (1938a) has shown the connexion 
between certain of these methods: the /^-statistic work of Mahalanobis, the 
discriminant function methods of the Gallon Laboratory and the generalized 
"Student 's" ratio of Hotelling. A similar very general problem was dealt with 
some time ago by S. S. Wilks (1932), while mention may also he made of two 
papers by I). G. Lawley (1938a, h) and a paper by P. L. Hsu (1938). The purpose 
of the methods put forward is to obtain information regarding the mean values 
of a number, say q, of correlated variables in one or more, say k, populations 
from which random samples have been drawn. If we denote by x s a value of the 
sth variable (5=1,2,,.., q), then in all this work it has been assumed not only 
that x„ is normally distributed, but that it has the same variance erf in every 
population sampled. Further, it is assumed that if x u is a second variable the 
correlation coefficients p m between x, and x u is the same in all populations. The 
estimates of variance and covariance required in order to "studentize” the 
function of the sample means are therefore obtained by pooling together the sums 
of squares and sums of products from all samples. While it is true that even if 
<r, and p m( are not the same in all populations the error involved may not be very 
large, it is however important to have available some means of testing the basic 
hypothesis which assumes homogeneity throughout the populations. 

Such a test has been derived by S. S. Wilks (1932) by an extension of Neyraan 
& Pearson’s likelihood ratio method of approach. Hitherto the somewhat lengthy 
computations required to obtain the moments of the sampling distribution of 
the test criterion have probably discouraged its use. The objects of the present 
paper are as follows: 

(a) In the simple but commonly met ease, where the k samples are of the same 
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s i Z e, to put Wilks’ test into such a form that once the lengthy ]>r.»,f im¬ 
puting the kq variances and \kq{q - 1} covariances has been carried out, relat ively 
little further labour is reqnired to obtain a test criterion which may he referred 
with practical accuracy either to Fisher's ^-tables or to the TnhU* nf th* Jnmm- 
plete Beta Function (K. Pearson, 1034). 

(b) In the ease where the sample sizes differ, to suggest an alternative pro¬ 
cedure which is accurate when dealing with large samples. 

It is of course always open to question how far a single comprehensive 
criterion is satisfactory in a complex problem. Certain point* should, however, 
be remembered. In the problem referred to above, dealing with the means only, 
the usefulness of a single criterion has been widely recognized, It. when applied 
to adequately large samples, a suitably chosen comprehensive mict ion shows no 
significant evidence of lack of homogeneity among the variances and mvamncea, 
we are saved the lengthy process of making many individual comparisons, If. 
however, the criterion falls beyond the significance level, it will l«* nmwiry, 
as when dealing with the means, to make a more detailed analysis in order to 
locate the source of disturbance. 

Finally, it may be noted that in the. ease of a single variate {*/ 111 he problem 

is much simplified, A full discussion of two tests available in tins raw, the 
Neyman & Pearson L x test and the Bartlett p test has recently been published 
elsewhere (Bishop & Nair, 103!)). 


2. Wilks’ generalized likelihood criterion 


It is proposed first to define this criterion in the simple east* when the samples 
from each of the k populations are of the same size, n; to quote the sampling 
moments derived by Wilks; to give the form of a working teat developed in the 
later pages; and to illustrate its use on an example, 

It will be convenient to use the following notation, We shall consider k 
samples, each of size n, drawn from populations of q variates, '{’hat is to my, the 
total number of individuals sampled is kn « N and for each individual? characters 
are measured, so that the total number of observations is ,Y</. l*«t ^ }, t . tf, r 
observed value of the character s of the tth individual in the tth sample. 

Then 5=1,2,q; <=1,2. k\ and» - 1,2. n. 


Let 


1 n 

■■ - X 

U {_1 


be the mean value observed for the character s in the tth 

1 » 

v ™i ~ n ( x sii ~ «k) ~ x m,) * 

* The use of the divisor n instead of the more usual w-1 
constant in the expression for A t . 


Now put 
.U) 

for a multiplying 
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where*. s, u --1,2,..., q. Then if s = u, v ml is just the variance of observations of 
character ,t •- u within the <th sample, whereas if s=£n, v aM is the covariance of 
t he characters h and v. 

The generalized variance in the ith sample is defined by the symmetrical 
flotcrminant 


.( 2 ) 


Then if 


l ’lM> V lZ/< ■" 

> v kt 

v m> • • • 

> V 2qt 

k’l ql’ V 2ql' 

> Vqqt 

11 

f^-i *-* 

2 



.(3) 


the. likelihood (criterion appropriate for testing the comprehensive hypothesis, 
say 7/ n , that all the corresponding variances and covariances in tho k sampled 
populations are equal is 

II 'W in 

A '“'fcr ■ . <4) 


HSUlUdsMKtll 


HHl 






niiasiiiini™ 




power of A t , rather than that of itself, owing to the extreme skewness of the 
latter distribution. Within limits there is some choice in tho power of A* which 
may he selected to give a convenient sampling distribution. Throughout this 
paper we shall follow Pearson & Wilks (1933) by using the l/Nth power, so that 
the criterion for use in practice will be 

/ n 


A > - v - / 
A l “ <1 


.(5*) 


If // 0 , the hypothesis tested, be true, the /i th moment about zero of the sampling 
distribution of l x has been given by Wilks (1932, p. 490) in the form 


, h " l fW(n~ 1) + U-h-i)) ‘ 


. («) 


As a result of trial in a number of cases it seems probable that the Pearson Type I 
distribution in the form 

. (,) 

will give a good approximation to the distribution of l u if the parameters m x and 

♦ Note that (i v (</ = 1)1* = L v in ths original notation of Neyman & Pearson (1931). 
Biomctrlka xxxi 3 
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n h are so chosen that the first two moments of (7) agree with the true values as 
given by (6). If this is done it is found that 

.(X) 

. 


and 

where 




(1-/4) 

/h 


/<2 = /<i”(/'l) 2 ' 


.(l‘l 

......( 10 ) 


fi[ and /<' being respectively the first and second moments about zero of I t . From 
the relation (6) it is seen that their values are 


ll ~ ill »)))>< 


D41 ,s))i 


fill ] 


and 


/4 = & n 


a «1 


(i\i(n+yk-«))) k nm »-1 ) +1 -•*))! 


. (121 


The hypothesis //„ will be rejected when l t is exceptionally low, The prob¬ 
ability level may he obtained from the Tables of the Incmjibh Hrln Function. 
Alternatively 5% and 1 % levels for 4 can be obtained from R. A. Fishers 
tables (19386, Table VI), by writing 


where z has degrees of freedom 


h 


h 

7*+/!***' 




h = 2 %- /s — 2?W|,* 


.(H) 


It is no doubt the labour involved in calculating the moments /<( and /d that 
has discouraged the use of Wilks’ test. As will be shown in the later sections of 
the paper, if n is not too small,f the following empirical relations may lie used 
with sufficient accuracy for practical purposes to express the w, and m i of (14) 
directly in terms of (a) the number of variables <y, (It) the Humph size n and (r) the 
number of samples k\ 

m l = k(n~q)-im(k~\) (i)0-3l»f/4 % 3 ), .(If.) 

m 2 = 0-25(fc- 1 ) 5(5 + 1), ...... (Ill) 

These formulae are found to give satisfactory results for all values of k, over 
the range considered for q, namely l to 6. 

In the following sections of the paper we .shall first give an example illus¬ 
trating the practical use of the test and afterwards justify the approximations 
which have been made. 

* Since in general A and/ a in (141 will not be integral, it is necessary to interpolate in the 
tables for fractional degrees of freedom. 

t Certainly if ?i^20, and probably if a>10 provided J<6. 
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3. Illustrative example 

The data used were taken from the appendix of a craniological study by E, 
Pittard (1909) and consist of measurements of skulls of males found in different 
localities in the Rhone valley. The characters considered are: 


(1) Length of skull in mm., denoted by L. 

(2) Breadth of skull in mm., denoted by B. 

(3) Height of skull in mm., denoted by H. 

(4) Breadth of face in mm., denoted by F, 


The skulls are divided into five groups of thirty skulls each, according to the 
neighbourhood in which they were found. The five groups are: 


( 1 ) 

( 2 ) 

(3) 

( 4 ) 

(5) 


Skulls found in the village of Biel. 

„ „ „ „ Naters. 


>> »i >> >> 

n jj n >> 

n n )> >j 


Viege. 

Rarogne. 

Sierre. 


It is desired to investigate if the population variances and covariances of the 
measurements of the four characters may be taken as the same for each of the 
five populations sampled. This is equivalent to asking whether the population 
variances and covariances differ from village to village. 

With the notation previously employed it is clear that in this case n = 30, 
k = 5 and q = 4, The sample variances and covariances, as given by ( 1 ), were 
calculated in the usual manner. Their values, together with the sample corre¬ 
lation coefficients, are shown in Table Ia. Using these results, the values of the 
generalized sample variances | v mil | , which arc also given in Table Ia, were 
calculated from equation (2). The substitution of these values of into 
equation (5) gives l x — 0-725. 

The probability levels of the sampling distribution of l x are given by (13) 
and (14), where 

m l = 120-hko and m, - 20-000 

are determined by substituting k — 5, n — 30 and q » 4 in the empirical relations 
(15) and (1(5), The levels are found to be: 

6 % level of z - 0-1830 and 1 % level of z = 0-2573, 
hence 5 % level of l x = 0-815 and 1 % level of l x = 0-791. 


The calculated value of l x being considerably below the 1 % level, the hypo¬ 
thesis that the variances and covariances are the same for each of the five popu¬ 
lations sampled must be rejected. 

Further tests may now be made in order to find where the lack of uniformity 



36 A Test for the Homogeneity of Variances and forarmnns 


TABLE 1a 


s 

U 

»W 

I'suH 




■i 

1* a 

-1 

L 

L 

311-2222 

46-1333 

46-9123 

29«267 

34-OS«#> 

39 23-87 

B 

H 

2A-4700 

28-4989 

22-5789 

43-4622 

21-75.56 

28-3544 

II 

H 

20-6989 

18-3733 

23-9156 

41-1289 

32 4933 


F 

F 

32-6489 

25-8707 

31-4322 

25-4!Htn 

16 0722 

264240 

L 

II 

22-0333 

1-5000 

10-5212 

- 2 4467 

mom 

,8-4504 

L 

H 

1-7444 

H-1HKX) 

0-3689 

19-4467 

0-2600 

6 1410 

L 

F 

28-055(1 

7-2333 

12-5922 

0-3867 

--0*3278 

9 .'*880 : 

B 

H 

1-8433 

10-5867 

— 2-268(1 

16-0822 : 

- 7-2ooo 

3-80.h7 I 

B 

F 

20-7400 

12-4567 

7-3744 

17-5067 

3-2111 

12-2578 

II 

F 

8-9844 

1-0467 

— 104KHI4 

15.3300 

0-9000 

3-3721 [ 

l.< 

i| 



1 l '.„t | 

I l ‘in2 ! 

1 '',<a ! 

1 •'»»( 1 

■ ‘W ! 

i : 

i 



138,521 

307,934 

519,105 

422.517 

296,115 

t 

383,487 j 

s 

u 

r„a 


**Jw3 


; 

1 

{ 

L 

II 

(1-69702 

0-04137 

0-32327 

- 0-116706 

■ 

0-39081 

j 

> 

L 

II 

0-05111 

0-30570 

11-0110! 

0-5.1522 

0-00781 


L 

F 

0-78401 

0-20935 

0-32792 

001402 

0 01.375 

1 

B 

H 

0-08701 

0-48265 

.-.0-09764 

0-3H0.3K 

- 0-270-80 

S 

n 

F 

0-71913 

0-45871 

(1-27681 

0-52597 

1 O-16M01 

| 

H 

F 

0-28853 

0-07552 

-0-36475 

0-47346 

! 003867 

5 


occurs. To decide whether the variances differ we may apply four single. variate 
L 1 tests, treating the observations of each character separately. When this is 
done the results following in Table IB are obtained, 

The tables of P. P. N. Nayer (1930) give the 6 % level for L v whim n « 30 and 
k = 5, as 0-936, so that there is no evidence to suggest that the variance* of 
measurements of any one character differ significantly from sample to sample. 

The lack of uniformity must therefore occur in the correlations, m that it is 
necessary to examine the variation in each of the six sets of five nurelation 


TABLE 1b 


Character 

w 

L 

0-985 

B 

0-967 

H 

0-964 

F 

0-974 
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coefficients. If r be a sample correlation coefficient calculated from n pairs of 
observations it is known that 

= !(log e (l+r)-log e (l-r)) 

is approximately normally distributed with standard deviation l/J(n-3) } the 
mean of z' being a function of the population correlation coefficient and the 
sample size n. Consequently, we may test whether correlation coefficients 
r aul (t = 1 , 2 , k), each based on n pairs of observations and obtained from k 

independent samples, differ only through chance fluctuations from some 
common population value p mi , by calculating 

= 3 ) 2 (*}-*')*, 


where z' - 4 S 

hi -1 

and entering the tables of tlie y 2 integral with k~ 1 degrees of freedom. A 
significantly large value of y 2 will indicate that the k samples cannot be con¬ 
sidered as having been drawn from populations with a common value of p. This 
procedure was carried out with the results shown in Table I a. 


TABLE Ic 


Correlation between 
characters 

t 

Remarks 

L and B 

14*31 

Significant 

L „ H 

7*82 


Ij H i* 

20*47 

Significant 

H „ It 

11(17 

Probably significant 

n „ f 

H-K 


H 7’ 

12418 

Probably significant 


The probability levels for the distribution of y a with/ = 4 are: 6 % level » 9*40, 
2% level » 11-07 and 1 % level « 13-28. It is seen that all the six calculated 
values of y a are above the expectation value of 4. There is significant variation in 
the correlation coefficients r uu and r ltFt , whilst it is largely a matter of personal 
opinion as to whether the suggestion of lack of uniformity in r nm and r uv 
shall be judged significant or not. 

To summarize: analysis of the data leads to the following conclusions: 

(а) The comprehensive test shows evidence of significant variationfrom sample 
to sample of the variances and covariances. 

(б) This lack of uniformity is not due to differences in the population vari¬ 
ances. 
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(c) There is clear evidence of heterogeneity among sonic ni the correlation 
coefficients, in particular for those, between length and breadth ««{ skull and 
between length of skull and breadth of face. Results ut this kind are frequently 
met with when dealing with crnuiological data, the correlation eoellit ieuf olten 
being subject to a considerable degree ol instability, '1 he meaning to be attached 
to these fluctuations in the correlations, when considered from a craniological 
viewpoint, seems to be somewhat obscure. 

The above example may give the impression that a great deal t>i labour is 
required, even after the comprehensive test is used. If, however, the compre¬ 
hensive test had provided no evidence for rejecting the hypothesis //„. it would, 
of course, not have been necessary to make the four single-variate h x tests and 
the six correlation tests. Even when the hypothesis //„ is rejected and if is 
necessary to apply separate tests to find the causes of the lack of uniformity, the 
only labour which will have been wasted is the relatively small amount involved 
in the calculation of the determinants | c sl)( [, The really lengthy computation is 
that required to obtain the sums of squares and sums of products on which 
Table I a is based; this cannot he avoided if a detailed analysis is desired. 

4. The adequacy ok the Tyke I aitkoximation’ 

The work now falls into two stages, which will he concerned with 

(a) the adequacy of the Type 1 curve of equation (7) to represent the unknown 
true distribution of 

(b) the accuracy of the empirical formulae (15) and (Hi) for hi, and m r 

The hope that the Type 1 form of curve might give tut adequate approxi¬ 
mation was based on two main considerations; 

(а) Values of l t are restricted and can only lie between zero and unity. 

(б) On other occasions the use of the principle of likeliluxid ha« yielded 
criteria which are either exactly distributed in the Type I form, or an* »o dis¬ 
tributed that a good approximation has been obtainable by the use of this kind 
of curve, 

In order to compare the true distribution with the approximate form, the 
first four moments of both distributions have been calculated in a number of 
cases, The manner of choosing the approximate distribution ensures that its first 
two moments will agree with those of the true distribution, and an idea of the 
accuracy of the approximation may be gathered by comparing the third and 
fourth moments of the true distribution with the corresponding momenta of the 
approximate form. The distribution is such that a comparison of momenta 
calculated about the mean is easier than that obtained when moments taken 
about zero are used. The cases considered fall into two groups: 

(i) k = 5; n = 10, 20, 40 and 50; ^=1,2, 3, 4, 5 and (5 
and (ii) n = 30; k ~ 2, 6 and 10; q = 1, 2, 3, 4, o and 0, 
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The process of calculation was as follows: 

t 

(a) The first four moments about zero of the true distribution were obtained 
by using 10-figure tables of logarithms of the Gamma function (E, S. Pearson, 
1922). A large number of figures were required in the values of these moments, 
since the dispersion of many of the distributions was small. 

(b) The values of /t 2 , fi 3 and /q, the second, third and fourth moments about 
the mean, were then obtained in the usual way and the constants m l and m 2 
were calculated by using (8) and (9). 

(c) The third and fourth moments about zero of the Type I distribution (7) 
are given by 

, = ffliK + l)K + 2 ) K + 2) ^ 

1 3 (% -I- m 2 ) (wq + m 2 + 1) [m x + + 2) (?/q + m 2 + 2) ^ 2 ’ 

d / = _ Wi(»» 1 +1)(«H+2) (w , + 3) __ J% + 3). 

1/4 (wq-t-wq) [m l + vu+ 1) (wq + j/ijj+2) (wq + wq + 3) (wq + wq + 3)"' 3 

.(18) 

so that, using values of wq and m„ already obtained, these may be calculated. 

(d) .From the values of x fi’ 3 and t /4 the corresponding moments about the 
mean, t /i 3 and ^tq, were calculated, Comparing /q and /q with ,/q and t /q, 
respectively, gives an idea of the adequacy of the approximation, 

Table II gives the results obtained; in addition, values of (i x ~ /illA mid 
lh ~ fkll l t arc included as measures of the skewness and kurtosie of the distri¬ 
butions with which we are dealing. 

It is seen from these tables that the distributions represented vary widely 
in shape, but as far as can be judged by comparison of third and fourth 
moments, the use of the Type I approximation would seem to be quite satis¬ 
factory in every ease considered. The eases dealt with cover a fairly wide field and 
have not been chosen with a view to obtaining especially good agreement of the 
moments, 

The inclusion of curves for which q ~ 1 may be questioned having regard to 
the fact, already stated, that the single variate problem has been fully treated. 
We must therefore remark that, as L x {/,(</ 1 )}*, the fact that the distribution 

of L x may be represented by a Type 1 curve does not imply that this kind of 
approximation will work adequately for the distribution of l x - W' Moreover, 
some of the distributions obtained by putting q « 1 are abnormally skew and 
highly leptokurtio. As it happens, when q ~ 1 the Type I curve appears to give 
as good an agreement as for other values, hut the exclusion of this set of distri¬ 
butions might have had the effect of making the approximation appear better 
than was really the case, in that it would have removed a number of the “more 
difficult” distributions. Thus, although tiie criterion L v will probably always be 
used in preference to l x (q = 1) = LJ, it seemed desirable to include distributions 





TABLE II 

Moment coefficients of l t in trial rases 


k = no, of population. 1 ) wimpled, q = no. of variaMi-«, a - w of iwuijJi'n 


<1 


k — S 
n = 10 

k - 5 
« - 21) 

k - 6 | 

j 

k 5 
n .. -jtl 

1 

Pi 

Pi 

pi 

lPl 

Pi 

0-9551,5592 

o<KMisi,:my 

0-9783,4(125 ! 
(I-IXHI2,1(1(11 1 

(l-UKIll, (12021(1 
(I4HHXI,1)37617 

(MIH'.IV, H3!aw. 
(HXXX),fi20392 



- (HXHKI,0-129 , 

-O-lXXXI,(112438 

.1118**1,(8(51 KH 


-04KXKI,3(164 
O-IXKX),04(11 

~(|.(XMH),(M:10 j 
0-(HHX),(Kl2(i 

-(HXHX t,ll 12440 : 
(l-ODIHI,000514 

-IMKXX(.(X»31H4 

IH***|,r88U52 


(HHXX),O402 

O-(MMH),(Hl20 | 

()-(HHX),(HKI505 

IHHXXMKHlIM 


If 

HU 

1-Kl 

1-88 

till 


fit 

5-2 

5-li 

541 

5(1 

2 

lh' 

0-8042,3289 

(1-93(12,9104 ’ 

041583,83(41 

n-'.m!«i.!i!ir.Hi'5 



041025,9457 

(HXKIH,2(104 j 

(MXK12.7472 

11-1**11,531271' 



-04X100,8171 

-0-IH)lX),l 137 

—O-tXKXl,1)345 

..11 IBaKi.nl 11.71 



~04XHX>,8178 

-04HXHl.II.Ti 

-II4XXX),034?) 

.-lMmi,(f{407? 


ll 4 

04XXX),2312 

0-lXXX 1,0140 

ll-IKXx),1X129 

M-tHttM.IIMtHIOl 


l/^4 

04X4X1,2312 

O-(XXK),01411 

(HKXX1,0029 

IHKX*!,lX*XH2 


If 

0-38 

(1-53 

0-57 

(1-39 


Ih 

3-4 

3-7 

3-9 

38 

3 


0-7330,1(741 

0-8737,2072 

011172.6148 

uli.'Kl.li.VXi 


fh 

0-0042,3500 

04X111,4801 

()4XKtfi.ltl75 

111**12,9159 


fh 

— (HHKKI.8380 

-ll-IHKK), 17(17 

-01HHKI.05IHI 

™ 111***1,1)21)2 


\Pi 

-04KKHI.H4OI 

-O-(HKK),17119 

— II-(MKH),II5!H1 

-IMKHXI.II2I12 


Pi 

(1-0000,5301) 

(HHMKl.0427 

(HKKHI.(KlftU 

01**11,(1029 


0-01HX),5-0)3 

(HKHK),1)428 

IHHHHI.IHIIHl 

11-1*88),181211 


K 

0-09 

0-21 

11-25 

9-27 


% 

34) 

3-2 

3*3 

... .. . 

3-4 

4 

Pt 

0-5708,8(105 

0-7939,4237 

—.y 

0*8639,47H.1 

11-81184,0-142 


Pi 

0-0050,1034 

0-0010,0539 

0-(XM)7,93(ll 

IHKXH.WHil 


Pi 

-0-0000,21)07 

-0-0000,19(18 

-04HHX),1)774 

-IKXX81,031151 


i/h 

-0-(XKX),3099 

—O4HXX),1073 

—04XHX),0776 

~(I'|*hhi,(i3(10 


Pi 

iPi 

0-0000,7250 

0-0000,0850 

<1-0000,0198 

11-1***1,0007 


Q-0(KK),72(12 

0-0000,0850 

0-0000,0198 

04XKX),(XH17 


A 

0-01 

O-OH 

0-12 

9-14 


Pi 

2-0 

'3-1 

3-1 

3-2 

6 

Pi 

0-4123,5890 

0-7010,282(1 

0-81X12,0843 

0-8.VHI.52l7 


Pi 

0-0045,0478 

04X120,6001) 

0-0010,5521 

(KKxm.3313 


Pi 

+ 0-(KXX),320H 

-0-IXXXU587 

-O-IKXXI,0829 

- (WKKKI,(H37 


i Pi 

+ 0-0000,2BB2 

- o-oooo, mm 

"04XKK),0831 

--0-0000,0438 


Pi 

0-(KK)0,5003 

0-0000,1202 

O-(XKK),0,341 

0-(KKXI,0|23 


iPi 

0-0000,5897 

0-0000,1202 

O-(XXX),O330 

IHXXXJ,t)124 


A 

0-01 

0-03 

11-0(1 

04)8 


rl 

2-9 

3-0 

3-1 

3-1 

6 

Pi 

0-2003,9120 

0-5909,9719 

0-7282,3088 

0-7944,5984 


Pi 

0-0030,6059 

0-0022,3214 

0-0012,0522 

0-0007,9288 


Pi 

+ 0-0000,4976 

-0-0000,0779 

-0-0000,0720 

—0-0000.044IS 

■■ 

lPi 

+0-0000,4551 

-0-0000,0823 

-0-0000,0734 

iHiliTOiWTll 

■■ 

mm 

0-0000,283( 

0-0000,1472 

0-0000,0480 

04)000,0185 

BB 

IBS 

0-0000,280' 

0-0000,1472 

0-0000,0481 

0-0000,01(11 

BB 

I mm 

0-09 

0-01 

0-03 

0-04 

II 

III 

3-0 

3-0 

3-0 

2-9 


ir - .V) 


M-juith.amiayi 

t! i««kv**2ii3h . 
(Htonu.imNOi i 
m i* ** ipio)a Ml7 i 

If 


M'9«m.2»ll,VJ0 j 
iit.««i.977u34 | 
0 i»»«i,i»i7,Vo | 
M MHOMWi.V*? I 

ii unai.i««i:iHl | 
m u»io.i«»i;ja:»; 
(Mil "i 

.fit 1 


<i »c.t<i.;'.;j7;to 
II l*»ll,-.932M 
II IM«l.(i|3M2tt 
iHRiai.oKtHiM 1 

M'2H 

.11 


IIHIIHUVr? 

trnmwm 

iMHItKl.Uftia 

tHKnm.iiuai 
( 11.1 
3-2 


II HHOM.nm.', 
(M***|,2u33 
- d-MKW.ltiUUt 
~ (ihixkuTO 

tl-OOUO.OIttS 

It-UU 

3-1 


0-8347.89&B -■ 
(141006,3842 ! 
—(HKX)0,(t280 i) 

-o-oomumi ) 

(MKXK).00fl5 | 
<MWQO.!XiK» 5 
11-06 
3-3 
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1 


n = 30 

n = 30 

n = 30 




fc = 2 

fc = 5 

k= 10 


1 

lh' 

0-0913,444323 

0-9861,020246 

0-9844,364077 



Mi 

0-0001,459732 

0-0000,937517 

0-0000,527977 



Ai 

-0-0000,047980 

-0-0000,012438 

-0-0000,00351(1 



iMa 

-0-0000,047992 

— 0-0000,012440 

-0-0000,003516 



Ih 

0-0000,002947 

0-0000,000514 

0-0000,000138 



I /*4 

0-0000,002947 

0-0000,000508 

0-0000,000124 



A 

7-40 

1-88 

0-84 



A 

13-8 

5-fl 

4-9 


2 

lh' 

0-9737,237167 

0-9583,8364 

0-9533,389057 



lh 

0-0004,404(162 

0-0002,7472 

0-0001,532142 



Mi 

-0-0000,141247 

-0-0000,0345 

-0-0000,009540 



\lh 

-0-0000,141253 

-0-0000,0345 

-0-0000,009535 



lh 

0-0000,012312 

0-0000,0029 

0-0000,000813 



ifh 

0-0000,012309 

0-0000,0029 

0-00(8),000787 



Ih 

2-33 

0-67 

0-25 



lh 

0-4 

3-9 

3-5 


3 

lh' 

0-9471,293171 

0-9172,5148 

0-9075,804094 



lh 

0-0008,673760 

0-001)5,1975 

0-0002,800073 



Mi 

-0-0000,264701 

— 0-0000,0590 

-0-0000,016837 



i/h 

—0-0000,204137 

— 0-0000,0690 

—()•( 8)00,015851 



lh 

0-0000,034381 

0-0000,0090 

0-0000,(8)2570 



,lh 

0-0000,033082 

0-0000,0090 

()•( 8810,002566 



lh 

1-07 

0-25 

(HI 



/4 

4-6 

3-3 

3-1 


4 

ih' 

0-9118,024417 

0-8639,4783 

0-8487,578437 



ih 

0-0013,941771 

(>•0007,9361 

0-0004,295258 



m* 

-0-0000,391427 

—0*0000,0774 

-0-0000,019930 



iMl 

-0-0000,301348 

-0-00(8),0775 

-0-0000,019982 



Ih 

0-0000,072501 

0-0000,0198 

0-0000,006013 



iMl 

0-0000,072501 

0-0000,0198 

0-0000,005630 



Ih 

0-57 

0-12 

0-05 



fh 

3-7 

3-1 

3-0 


15 

ih’ 

0-8682,461741 

0-8002,0843 

0-7701,127138 



!h 

0-00111,748075 

0-0010,5521 

(MKK)5,595144 



lh 

-0-0000,493446 

-0-0000,0829 

-0-0000,020115 



i/h 

-0-0000,493538 

—0-0000,0831 

-0-0(88),020241 



lh 

0-0000,131257 

0-0000,0341 

0-0000,009414 



i/h 

0-0000,131282 

0-0000,0339 

0-0000,009434 



A 

0-32 

0-06 

0-02 



A 

3:4 

3-1 

3-0 


0 

ih' 

0-8172,206960 

0-7282,3688 

0-7013,941584 



Mi 

0-0025,529007 

0-0012,6522 

0-0006,547817 



N 

-0-0000,543573 

-0-0000,0726 

-0-0000,016168 



iMs 

— 0-0000,544455 

-0-0000,0734 

-0-0000,016440 



Ih 

0-0000,206107 

0-0000,0480 

0-0000,012833 



i lh 

0-0000,200344 

0-0000,048) 

0-0000,012845 



Pi 

0-18 

0-03 

0-01 



fit 

3-2 

3-0 

3-0 
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TABLE II ( emiinmt ) 



1 

k 2 
n - 30 


IWMI13.444323 

U-00Ol.4ft!i732 

-.U-0000,0471192 
U'0fHKi,IHU‘.M7 

7'40 
13* 


'/ 2 
A 2 
>i = 311 


1141737.237107 
Otm04.404fW2 
-IMHKHIJ415147 
-IHKKHI, 141233 
(HKHHI,012312 
(HKKHI,Ol23frtl 
2-33 

0'4 


V 1 

i. 2 


n 


■ t 

it !#*»U»l0221 

MlKKK».<*t<t37. f t 

7 Hi 
Ml 


I 


II I I 3*t 


I) ilH4.'*,4Wi?f* 

iiunMl..‘t:ii\HiH 

I HOKUM 13033 4 
IHHKKIJKIl.'»"l> 
l|!»KKM* , |.’»S4 



It will be noted that there are aeetiinm common to the three )mrU «rf Tiilile II. Tin'.*-' have L eu 
included more than once in order to facilitate <'mn[w.riw>iiH in any mu* |*nH of the (mlilr*. 


for which q - 1 when discussing the adequacy of t he Type I approximation <0 the 
distribution of l v 


Consideration of an alternative form of approximation 

For the larger values of n which have been considered it in aeon that the 
distribution of I L is of small dispersion and is situated close to unity, the upper 
limit of possible values of l x . In fact, in many of the eases dealt with, the mean i» 
separated from the start of the curve by some twenty or thirty times the standard 
deviation. In these circumstances it seemed possible Unit u better approxi¬ 
mation might be obtained by taking a Pearson Type I curve in the form 


P(h) 


__ r(m t + m 2 ) 

(1 - b) Wl+ »Ki~i f( mi ) [ \m s ) 


[l x -h) m ' 1 (I for /tsS^ss; 1. ...{HI) 


There are now three assignable parameters; b, defining the start of curve, 
and m 2 , which may be selected by equating the first three moments of the dis¬ 
tribution (19) to those of the true distribution of l x . As before, an idea of the 
degree of approximation may be obtained by comparing the fourth moment of / lt 
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calculated from the mean, with that of the distribution (19). This procedure has 
been carried out in a few cases, the results obtained being set out in the upper 
part of Table III. 

As far as can be judged by a comparison of moments, there is little difference 
between the approximations obtained by using a Type I curve in form (7), 
which is fitted by two momenta, and the Type I curve in form (19), which is 
fitted by three moments. However, as we are primarily concerned with the 
probability levels of the distribution of l v the effect on these levels of the change 
in the method of approximation must be ascertained. 


TABLE III 

Comparison of distributions fitted by two and three moments 



n = 30 

a = 30 

n = 30 

n — 30 

n = 30 

n = 30 


fc = 5 

jfc = r> 

k = 0 


k = 6 

k = 5 


7 = 1 

<1 = 2 

7 = 3 

7 = 4 

7 = 6 

7 = 6 

/ Of true distribution 

6-14 

20 

00 

108 

341 

480 

/( 4 x Kb j Of distribution (7) 

6-OK 

20 

00 

108 

330 

481 

(.Of distribution (JO) 

fi-OK 

20 

oo 

108 

330 

480 

r.if i, >VI> i /Of distribution (7) 

“ lcvu \()f distribution (lit) 

0-1)074 

0-0280 

0-87(17 

0-8140 

0-7446 

0-6678 

0-1)074 

0-0280 

0-8767 

0-8149 

0-7446 

0-6677 

!«’ / Of distribution (7) 

“ (Of distribution (10) 

0-0040 

0-0110 

0-8663 

0-7916 

0-7102 

0-6411 

0-0040 

0-0110 

0-8662 

11-7916 

0-7102 

0-6413 


The method of obtaining the 5 % and 1 % levels when the distribution is in 
form (7) has been given. When the second approximation is considered, the 
transformation 7 _ /, 


gives the distribution of t as 

so that, as before, the tables of z may be utilized in calculating the levels. The 
lower part of Table Ilf compares the levels given by the two methods of approxi¬ 
mation. 

It is seen that, for all practical purposes, the probability levels as given by 
the two forms of Type I curve may be taken as identical, so that nothing is to be 
gained by using the more complicated distribution (19). Furthermore, the agree¬ 
ment obtained strengthens the conviction that the Type I curve in form (7) 
really provides a good approximation to the true distribution. From now on¬ 
ward, therefore, we shall only be concerned with the approximate distribution 
given by (7), 
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6. Direct comparisons in cases where the 
true DISTRIBUTION OK /] is known 


When only two groups are considered the distribution of f is known in the 
cages of one and two variates. These known distributions may la* used to test the 
accuracy of the probability levels as obtained by using the Iv|h‘ I curve, If 
q = l we have seen that l\ - L t . P. I*. N. Nayer (lH3fl, p. 411} has shown tlmt, 
when k - 2, 


2" 3 [V 


j /,{ l.fnHK/.,- 1, 


i) niy 

no», J'U.nw -—rnrue „ '■< 1 

“"■'t t)| 


Hence, /’{I, <I?} - /,(!(» - i). where x « (I?)*.t~U 

When q » 2 and k = 2, Pearson & Wilks (15133) have proved that 

jDO 70) _ (" ?l ” i) _ j (Shin 4 ' + V) * (A I 1 ' I lif f n , .» 1 ] 

? {A< Ai - r (w _ iji^ _2) 2*«- *» r 1 ' gf l i“ If 7 * ** 

where x - (11)*, .{22} 


Thus if k « 2, relations (21) and (22) enable the probability integrals of ij {*/ l) 

and l x (q ~ 2) to be obtained by using Tables of Ik InemnplHt IhUt Function. 
In this manner the true probability associated with each of the 5% and 1 
limits, obtained by using the Type I approximation, may be calculated. Result* 
obtained are given in Table IV. 

For the values of n considered it is seen, from Table IV, that the true prob¬ 
abilities are very close to the desired values 0*05 and 0*01. It should bn pointed 
out that, as is shown in Table II, the six distributions considered above are 
unusually skew and leptokurtic compared with the remainder of the distri¬ 
butions, and moreover the agreement of true and approximate distribution, ax 
judged by the moments, is not noticeably better for these curves than for the 
curves in general. Hence, for distributions which, judging from the value* of 
k a»d k> m certainly not in any way favoured, the limit* «d by using the 
approximation are found to give true probabilities near enough to the desired 
values for all practical purposes. 

To summarize the salient points in the preceding sections, we have seen that; 

(a) For a wide range of distributions close agreement between the third and 
fourth moments of the true and approximate distributions is obtained. 

(b) Type I curves in form (19) fitted by three momenta, which might be 
expected to give a better approximation, lead to probability levels which are for 
practical purposes the same as those given by the Type 1 curve fitted by two 
moments. 
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(c) In cases where the true distribution is known, the limits set by the approxi¬ 
mation give values of the true probability near to those desired, 

We may therefore say with some confidence that, having regard to the above 
considerations, the use of the Type I curve in form (7) as an approximation to 
the distribution of l ± seems to be amply justified. 


TABLE IV 




7 = 1 

7=1 



& = 2 

k = 2 



n = 80 

n = 50 

Typo I 5% limit 

True probability 

0-H936 

0-9009 

0*9804 

0-05005 

0-05908 

0-04050 

Type 11% limit 
True probability 

0-8235 

0-9435 

0-9064 

0-00998 

0-01901 

0-01002 


7 = 2 

7 = 2 

7 = 2 


fc = 2 

k ss 2 

k = 2 


n = 10 

n = 31) 

n = 50 

Type t 5 % limit 
True probability 

0-7814 

0*9324 

0-9090 

0-05005 

0-04997 

0-04979 

Type T 1 % limit 

0-0989 

0-9O34 

0*9425 

True probability 

0-00999 

o-oomm 

0*00908 


0. The limiting form of the distribution of 

As was the case in the single variate problem (Ncyman & Pearson, 1931), 
the distribution of l Y in large samples may be obtained approximately from that 
of y 2 . M>,(/!).), the Ath moment of the sampling distribution of A! = If, may be 
obtained from (0) by replacing h by Nh. Using Stirling's approximation, 

!'(x)~^l(2n) x x i r ■ r , 

it may readily be shown that M h {\ { ) tends uniformly to (1 + h)" u as n tends to oo, 
for all h > 0, where 

1)7(7 +1). .(23) 

The distribution of y 2 with / degrees of freedom is 

p(y a ) = {21/ /’(A/)}” 1 (y 2 )!/™ 1 ?,”!/:*, for (Ky^co.(24) 

If we put y - e-iV (ho that 0 < y < 1), the Ath moment of y is 
ri f o> 

M h (y) = y h p(y) dy = {21/ i\\f)}-' (y 2 )!/" 1 e-W+» d(y 2 ) 

Jo Jo 

= (1+A)-1/. 
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Thus the corresponding moments of the distribution of y and the limiting form 
of the distribution of A, are equal. The range of possible values nf the variables 
being finite, i.o. from zero to unity, the equality of moments is mifliwnt t« ensure 
that the distribution of y and the limiting distribution sd A, are identical 
Therefore, in largo .samples, -2 log,A, = - 2.Y log,/, will be ajqtntximatHv 
distributed as y 2 with/ degrees of freedom. Thus if 

ijafjWA’, .(2ft) 

it follows that the probability levels for I, nmv he obtained by inserting in (2ft) 
the corresponding levels for y*. It had been hoped that this method of rslni- 
lating the probability levels might prove satisfactory for moderately large* values 
of n, so that the labour of calculating the first two momenta of I, could have been 
avoided. In actual fact, however, this hope is not realized, as is shown by the 
results given in Table V. 


TABLE V 


(1) Level from Type. I 

(2) Level from X s 


Difference (2)- 

by /(/*«) 


(1) divided 


(1) Level from Type I 

(2) Level from y ! 
Difference (2)—(1) divided 

by /W 

3 | (1) Level from Type I 
(2) Level from y 2 
Difference (2) - (1) divided 

by /W 


(1) Level from Typo 1 

(2) Level from y 2 
Difference (2)-(l) divided 

b y V(/h) 


A-= 2 
»*30 


r,n.' i «/ 

*’ .11 1 Ml 

11*1)11(19 0-1HD5 
0*0685 0*94(12 
0*13 0*22 


0*8913 0*8570 
0*9004 0 
0*31 0*40 


(1) Level from Typo I 

(2) Level from y 2 
Difference (2)-{l) divided 

by VW 


(1) Level from Type I 

(2) Level from y 2 
Difference (2) - (1) divided 

by VW 


a 

30 


0*7884 (1*7476 
0*8119 0*7750 
0*53 0*62 


f,(i* in 

0*9074 0*11540 
0*11089 0*9507 
0*15 0*22 


11*0281) 0*911(1 
0*9323 0*9103 
0*20 0*32 


0*8707 0*8503 
0*8857 0*80115 
0*3(1 0*45 


0*814(1 0*71115 
0*8304 0-8087 
0*55 (Mil 


0*7446 0*7192 
0*7684 0*7448 
0*73 0*79 


0*6678 0*6411 
0*7014 0*6769 
0*94 1*01 


1* 5 

» . * 411 


A 

Mi 


V 


11*11757 0*18102 : IHMIi l» 1(731 
11*9700 0*1)673 ; 0 9812 IHI73<* 
0*12 o*15 ' IM» i*12 


i jo j 

*i30 •! 


01(709 0*9029 ; 
" 9722 O 904.’, 
ola i<22 : 


0*9403 0*9330 0*9,173 M 947M i 
0*1)488 0*9300 1 (H15H8 n nm j 
0*20 0*24 0*15 «*!9 j 


048179 0*8922 < 0*9205 tM»ISH IW783 IIKM3 
0*9130 0*8(181 <1*9298 H*817H O8H07 (487,30 
0*30 0*31 10*24 (1-28 Su-50 Ofa 


0*8910 0 * 8*128 j 0*8887 118*37 
0*8(19!) 0*8528 S 0*8945 II MMM S 
0*41 047 ! 0 33 0 3k I 


0*8007 0*7807 
0-8206 0-8018 
0*55 0*60 


0*7462 (1-7244 
0*7665 0*7463 
0*72 0*78 


0*8446 t»*82«! 

mm U H380■; 

0*44 0*48 


0*7935 0*77815 
0*80.83 0*7313 i 
0*04 0*05 
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It. is evident, from inspection of 'fable V that, as would be expected, the 
agreement between the levels, as given by the Type I and y 2 approximations, 
improves with increase of n, whereas it slowly becomes worse as k is increased. 
However, as q increases there is such a rapid deterioration in the accuracy of the 
approximation that this method cannot be justifiably employed, except for 
large values of n. 

It may however be usefully noted that the levels obtained from the y 2 trans¬ 
formation are always above those set by the Type I curve, so that if the hypothesis 
is not rejected when the level given by y 2 is employed it certainly would not be 
rejected if the more accurate Type I level had been calculated. 


7. Empirical relations for m x and m 2 
When using the test it is found that the calculation of the first and second 
moments of l x accounts for the major part of the labour of computation which 
is involved. It therefore seemed desirable to attempt to obtain empirical relations 
giving m x and m i in terms of n, k and q, such that the use of these values will 
lead to probability levels of l x sufficiently accurate for practical purposes. 

We have seen that M h (X x ) tends uniformly to (1 +/i)*'/ as n tends to co. 
Put l[ - Ai / * ,, y where M is any positive number. Then M h (l[), the Ath moment of 
l[ tends uniformly to (1 + A/ikf) ' */ as N ~> co. If a Type I curve be fitted to the 
distribution of l[, in a way similar to the one employed when approximating to the 
distribution of l u the exponent of the power of (1 -l[) will be given by 

_ H-MS'ylUM.d'O-MSi)) 

! - ' . ’ 

and this may be considered as a function i//(Af, JV), of M and N. 

Now, 


lim i J/(M, N) = lim m 2 

N->oq iV->m 


{1-(1 + 1/M )--»/}{(! + I/M)-l /-(l + 2/ilf)-</ } 

(i + 2/jtf)-«/-(i + i/jfy h 


the limit being approached uniformly for all positive M, and lim <j{M) = I/. 


Hence lim tfr(N, N) - lim m 2 If = 0*25 (k- 1) q(q+ 1). 

A’- 


i 


Values of m x and m, which have been calculated in a number of cases are 
given in Table VI. The values of m 2 indicate that the limit A/is approached rapidly 
with increase of n t and it, seems probable that, for % £ 20, no error of practical 
importance will be made if we substitute \ J for m t . 

Methods of approximating to m x must now he considered. U. S. Nair (1938, 
p. 285) has shown that the distribution of l x may be reduced to the form 

Pih) — Piih) VS i)> 

where p 1 (Z l ) = const 1 


and 


vSi) 


i 

ini 


‘l>» q 


mg-j+m* 


■i)+i -j+t)) 


dt, 
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TABLK VI 



Value# of m x a 

id Wa 

.. 



7“ 1 

n <> ‘ 

V 3 

7 !; 4 

V. 5 j 

- f 

? - ■ « 

k = 2 

17-351 

15-500 ' 

i 




n- 10 

0-501 

1-501 i 

j 


j 


k~ 2 

57-282 

55-588 ’ 

53-732 

5Hl*3 

40432 ] 

4« 5*07 

n = 3(l 

0-500 

1-500 j 

2-000 

-H.HW 

T.Vfl | 

In .711 

t~ 2 

07-260 

115-880 



! 

i 


n- 50 

0-500 

1-500 i 
! 

..._4 


1 

. . i 


k~r } 

42-007 

.j 

38-210 ! 

l 

38-111 ; 

27 527 

21 TOO 

111 Its 

n = It) 

2-005 

(1-004 

12-dlK 

20-180 

31-022 ' 

45 1553 

/fee 5 

02-501 

88-188 

83-053 

77-KHt 

J 

TM j 

63012 

n = 20 

2-001 

tt-(KK) 

12-064 

20-030 

< 

30 177 

: 

42 >dKt 

k = 5 

142-550 

138-177 

133-033 

127-1814 

128*430 

113-187 

n = 30 

2-000 

(1-0(81 

12-001 

20-014 

■ 

42 230 

k = 5 

192-543 

188-178 

183-014 

177-043 

170-2M5 

M2 »3» 

n « 4(1 

2-000 

«-(KKI 

12-(KHI 

2» mis 

30038 

42 127 

fc = 6 

242-532 

238-175 

233-020 

227-017 

220-101 

aia-w«M 

n = fiO 

2-000 

6-000 

12-001 

20-005 

30-023 

42 075 

**=10 

284-089 

275-830 

205-205 

252-811 

238-8(51 

223(46* 

n = 30 

4-801 

13-501 

27-000 

45-0411 

(17-710 

05 21 ( 

= 25 

711-207 

088-001 





n = 3() 

12-004 

_1_ __ 

30-007 



i 



In each coll i« the upper and m t the lower number. 


so that p t {l x ) is independent of n. Now' if |/replaces w |t the Typo I dbtribution 
(7) takes the form 

p(l j.) - const \ 

so that it is expressed as the product of two factors Ip 1 end (1 -l i V > l » the 
second of which is independent of n. In these circumstances it seemed jHwsibte 
that m 1 might perhaps be replaced by k(n-q). Therefore putting 

m x ~ k(n~q) 

% = if = 0-25(&-1 )q(q+ 1), 


and 


....,,( 20 ) 
...,.( 10 ) (m 
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the 5 % and I % levels of l t were calculated in a number of cases and compared 
with those given by using the values of m l and m i obtained from the moments. 
The results given in Table VII indicate that, although the agreement in cases 
where q- 1 is satisfactory, for larger values of q the discrepancies become too 
large to enable this method of approximation to be used in practice. 

TABLE VII 

Comparison of levels given by Method 1 and Method 2 





i 

■ 


<1. 

= 5 


Method 

«% 

w 

5% 

1% 

r. 0/ 

'> A) 

1% 

k = 2 

1 

0-907 

0-944 

0-801 

0-858 

0-788 

0-748 

n = 30 

2 

0-007 

0-944 

0-892 

0-858 

0-701 

0-750 

k~b 

i 

0-800 

(1-857 

0-021 

0-570 

0-304 

0-203 

10 

2 

0-001 

(1-80-4 

0-035 

(1-585 

0-340 

0-304 

k = 5 

1 

0*050 

(1-031 

0-814 

0-784 

0-024 

CHI 

n = 20 

2 

0-052 

(1-033 

0-818 

0-788 

0-040 

0-007 

k = r> 

1 

0-007 

0-055 

0-877 

0-850 

0-745 

0-710 

n = 30 

2 

0-000 

0-955 

0-878 

0-858 

0-752 

0-728 

k = 5 

1 

0-070 

0-000 

0-008 

0-802 

0-807 

0-787 

n = 40 

2 

0-070 

0-007 

0-000 

0-803 

0-811 

0-792 

k - 5 

1 

(1-081 

0-073 

0-027 

0-914 

0-845 

0-828 

it » no 


0-981 


0-927 

0-015 

0-848 

0-831 

It ~ 10 


Wm 


1 

0-804 



n « 30 


■ 

■9 

■jjgj» 

0-800 




Method 1, m, and m„ obtained from the first two moments oll v 
Method 2. m, and m, t given by tilt* empirical relations 
»»i - Kn-q), m % = 0-2S(A-1) q(q+ 1). 


It is clear that the deviations are due almost entirely to an inadequate 
approximation to m lt although the expression does not appear to differ from m l 
in such a marked manner that its use as a basis of approximation must be rejected 
entirely. Thus it was thought that some slight correcting term might be sub¬ 
tracted from k(n—q) to provide a better approximation to m v With this end in 
view values of the necessary correction 0 = k{n~q)-m x were calculated and 
are given in Table VIII, 

Biometrika xx.xt .( 
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TABLE VIII 

Vufmw of correction <" k(n-ql ”h 



q ■- 1 

q ~ 2 

k = 2, n = 10 

0-115 

0 - 4(1 

k = 2, n = 30 

0-72 

11-41 

«: = 2,n = 50 

0-73 

(HI 

k «= fi, ii =10 

2-3(1 

i-72 

k « 5, « a 2(1 

2-41 

Hd 

k = 5, n = 30 

2-44 

past 

k «r».». o 40 

2-4(1 

1-K2 

k » 0, n - 50 

2-47 

1-02 

k « 1(1, it - 3(1 

5-31 

4-10 

k a 25), H s 3(1 

13-70 

11-10 

1 

i_ 


| 


q 3 

7 '• 4 

v - 

1 f - o 

(1-27 

)( 32 

o 57 

} )*.t 


4 . 


s 

1-HO 

1 #-'Yi 

3 23 

■ 3 *2 

1-05 

1 i»-Mi 

( 21 

f, no 

1-07 

I hn 

4 00 

i. *1 

1 -00 


4 71 

7 17 

l-0s 

\ 

4 S? 

? in 

4-70 

i 

» 7 10 

11 II 

m 55 


When 20, the variation of V with n. Fur fixed k and q. n* »iu«H ?*>< that m 
seeking a simple empirical relation for (\ function* of k and q only weir cun 
siderecl. If, for constant </, (' he plotted ugiiiu*t k. it i« found ibid flu* rrhitimi 
between them is nearly linear and, moreover, (' may he taken m proportioned to 
(fc-l). Assuming therefore that (■ « (k- values of a r) for different >/ may 
be calculated. On plotting f) n against q, the form of the resulting curve suggests 
that, over the range of values of q which are considered, W v will he given with 
sufficient accuracy by a quadratic form in q. Making this assumption the eoefft 
eientsofthe quadratic were adjusted hy trial and error until the expression which 
resulted seemed likely to give satisfactory results in most eases,, The correct ing 
term was thus tentatively obtained in the form 

(J = Odd (k- 1}(«()- :«!(/»• fk/*). 

The final replacement for m,, as previously given, is therefore 

% » k(n-q)~lMtt{k~ l) (Ml- fitly + fly*).f J f>) Mm 

and as before m 8 « 0-25(fc— l) q(q p 1), .(to) hut 

It is realized that the reader may have some doubts as to the validity of the 
empirical methods by which this approximation for m, was obtained. In order 
therefore to establish this approximation as a satisfactory practical method, it 
is necessary to calculate for a wide range of cases the 5 % and 1 % levels given by 
the approximation. These may then be compared with the levels obtained when 
values of m x and m 2 calculated from the moments are used. This has been done in 
a number of cases, tbe results being given in Table IX, 

_ Inspection of this table shows that when n -> 2b the levels obtained by 
using values of % and m a as given by (15) and (18) agree with those resulting 
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TABLE IX 


Comparison of levels given by Method I and: Method 3 





ff = 

1 

? = 

2 

V = 

•3 


4 

1 = 

= 5 

'/ = 

= 6 

k 

n 

Method 

5% 

1% 

6% 

1% 

5% 

1% 

a 

1% 

5% 

1% 

5% 

I 0/ 
x /0 

2 



0-894 

0-824 

0-781 

0-699 

_ 

_ 



jm 

1 


__ 




0-894 

0-824 

0-780 

0-698 

— 

-- 

— 

— 

— 


— 

— 

2 

30 

l 

0-967 


0-932 

0-903 

0-891 

0-858 

0-843 

0-805 

0-788 

0-748 

0-728 

0-685 



3 

0-967 

0-944 

0-932 

0-903 

0-891 

0-857 

0-842 

0-804 

0-786 

0-745 

0-725 

0-681 

2 

50 

1 

0-980 

0-066 

0-960 

0-943 

— 

— 

-- 

— 

— 

—. 

— 

— 



3 

0-980 

0-966 

0-960 

0-943 

— 

— 






"" 

6 

10 

1 

0-800 

0-857 

0-772 

0-724 

0-621 

0-570 

0-458 

0-409 

0-304* 

0-263* 

0-174* 

0-145* 



3 

0-89(1 

0-857 

0-771 

0-723 

0-019 

0-567 

0-454 

0-405 

0-292 

0-251 

0-146 

0-118 

5 

20 

1 

0-950 

0-931 

0-891 

0-865 

0-814 

0-784 

0-724 


0-624 

0-690 

0-521 

0-488 



3 

0-950 

0-931 

0-890 

0-805 

0-814 

0-784 

0-723 

0-6IK) 

0-624 

0-590 

0-520 


r, 

30 

1 

0-967 

0-955 

0-928 

0-911 

0-877 

0-856 

0-815 

0-792 

0-745 

0-719 

0-668 

0-641 



3 

0-967 

0-955 

0-928 

0-911 

0-K77 

0-850 

0-815 

0-791 

0-745 

0-719 

0-668 

0-041 

10 

30 

1 

0-971 

0-963 


0-920 

0-878 

0-864 

0-813 

0-797 



_ 




3 

0-971 

0-963 

0*931 

0-920 

0-878 

0-804 

0-813 

0-797 

—. 

— 

- -’ 


25 

30 

1 

0-975 

0*970 

0*936 



..... 




.... 





3 

0-975 

0-970 

|gg 

0-929 

.. «y 


,..,u 

* ■*" 




— 


Method 1. m t and rn t obtained from the first two moments of Z t . 
Method 3, m i and m t given by the empirical relations 
»t, » k{n - ?) - Milk - 1) (90 - 39? + 9g 5 ), m 4 -- 0-25(fc-1) 1). 


front the use of values of wtj and m 2 calculated from the moments with sufficient 
accuracy for practical purposes. Even when n is as small as 10 the only serious 
discrepancies, marked with an asterisk, occur when there are as many as live or 
six variables. 

The range of values of m t and m a of the distributions considered in the last 
table may be found by consulting Table VI. From the practical viewpoint, it 
must be noted that when both % and m 2 exceed 60 it becomes impossible to 
interpolate in the z-tables without running the risk of making appreciable errors. 
In these cases an approximation, due to Fisher (19386, § 41), may be used. When 
z has degrees of freedom f x and/ s which are both large and/ = 2/ 1 / J /(/ 1 +/ J ) is 
their harmonic mean, the 5 % level of z is given approximately by 


1-6449 

_ 2-3263 

2 '« “/(/- 1 )' 


•0-7843 


1-235 


h 

(K 


and the 1 % level by 
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8. Some konst hkhation ok tub cask ok rs *:<<!*,u. 

Wo have ho far considered k equal -samples eswh »i w«‘ a. but in Hu* genml 
case there may he Samples of sizes »,(£ - I, -.*)• The appmpml* rritwion 


in this case is 


//; /! zw|r ,?v i 


1 "* 

“ (•* «(i“~ * 

where now 

I'mit ” „ (■* .Mji 

n H - 1 


_ 1 "1 . 

■hi. „ 2 j '* rJi'- 

k 

and 

X » v « { , 


tin t 

i i 


It is seen that l\ is the ratio of a weighted gromefrir mean of fin* generalized 
variances to a determinant of order q in which the element in the atl* row and 
uth column is v m , the weighted arithmetic mean of ... «„ 4 , the weight 

given in each ease being the corresponding sample size. When eutistdering single 
variate criteria, Bartlett(I5J3?) suggested that a heller test wdlWHblmnrdjf each 
variance is weighted with the number of degrees of freedom it p* rather than 

with the sample size, and more recent unpublished work eunlhrm» this *>intention. 
Thus there is a possibility that in the general ease, where more than one variate 
is considered, some adjustment of the weighting will yield n criterion win* h will 
more frequently detect the falsehood of the hypothesis //„ when m f«et «»me 
alternative hypothesis is true. That is to say, the modified test might prove t o he 
more powerful, in the sense of Nay man & Pennon [ 1H3II), with regard ton wrtain 
set of alternative hypotheses. However, it is likely that such a un«hii*-a!ion will 
only have an appreciable effect when some, at least, of llu* samples are smalt. 

The preceding reasoning concerning the limiting form of the distribution of 
lx may lie shown to apply whether the samples are of the stunt* size or not, The 
distribution of l lt as approximated to by means of the x 2 transformation. de}*ends 
only on N, Ic and q, and not on the size of individual samples. It therefore mnm 
possible that with large samples, which although unequal do not vary greatly in 
size, a reasonable approximation to the true levels may 1 m? obtained by wing t he 
X z transformation of equation (25). 

The accuracy of the ,y a method of approximation may hr improved by 
following a procedure similar to that used by Bartlett (Mil) when Binding with 
a single variate. In the general case, the hlh moment of A, is given by 


*»(Ai)-n 

i=i 


rMl + k)-i) 

NW'»i * l * 2 


11 f?) 


<i 

■ 11 
(»* t 


r 


J A ““ k + i \ i 

-4' 


pi N( 1 + A} — k ^ t — i, 

l \ . ' 2 


In the limit -21og 0 Ai is distributed as with / degrees of freedom and our 
purpose is to find a correcting factor 0, a function of n„ k and q which only 
involves terms of order K)' 1 , such that -2G' Mog f A 1 has a moan value differing 
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from /, the mean value of y 2 , by terms of order {n t )~ 2 , Thus Gf must be identical 
with the sum of terms of order ( n j)" 1 in the coefficient of h in the expansion, in 
powers of h, of log,, {A/L^/At)}. Using Stirling’s approximation we obtain 


0 = 



. n < 


*,/(* + »-1) 2 k + i- 1. N 

if i \ r 2 N + N-k+l-i 3 (N-k +1 -if, 


If now - 2G-'" 1 log,, Aj be referred to the y 2 distribution with/ degrees of freedom, 
approximate levels for l x may be obtained from the corresponding levels for y 2 
by using the relation j G\f\ 


k = exp 




2 N 


.(27) 


The degree of accuracy provided by this method is indicated in Table X, which 
compares, for some typical cases, the levels given by (27) with those obtained by 
fitting a Type I curve, 

TABLE X 



7=1 

7 = 2 

7 = 3 

7 = 4 

7=3 

q n 0 

k ~ 5 
n,- 10 
(/=],.. 

.r») 

kiv , |tvri fFtt#n Type I 
' 1 From relation (27) 

(>•800 

0-800 

0-772 

0-774 

0-021 

0-028 

0-458 

0-473 

0-304 

0-328 

0-174 

0-200 



‘sHJKKU 

(1-8.77 

11-807 

0-724 

0-72(1 

0-570 

0-577 

0-400 

0-425 

0-203 

0-288 

0-14.7 

0-170 

k -5 
n, - 20 
(/=],.. 

„ 5) 


0-9.70 

0-9.70 

0-801 

0-800 

0-814 

0-813 

0-724 

0-724 

0-024 

0-028 

0-521 

0-528 



uiH!Sa&, (27) 

0-9.11 

0-031 

0-80,7 

0-80(1 

0-784 

0-783 

0-091 

0-092 

0-590 

0-594 

(1-488 

0-495 


It is apparent from Tables V and X that the crude y 4 approximation has been 
considerably improved by the use of the correcting factor 0. Although examples 
in which the Ic samples are of unequal sizes have not been given in Table X, it is 
probable that, if the smallest sample contains at least twenty individuals, the 
application of the above method will give a fairly satisfactory result, 

Mention may be made in passing of a similar approximation which is found 
to give close agreement with the distribution of the comprehensive likelihood- 
ratio criterion for testing one of the hypotheses concerning the values of the means 
of populations referred to in the introduction to this paper. It is assumed that 
the hypothesis //„ regarding the equality of variances and covariances is true, 
and it is desired to test whether the corresponding means of the q characters are 
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the same in all k populations. In the case of two samples (/; 2). the test is 

equivalent to use of Hotelling’s generalized 7'. It 

1 k lit 

M„H ~ V E H (• r nli~ X ».) (‘til 
jV t 1 { I 


] k n, 

where — «r jL !L tiim 

n (-■ l i i 


the appropriate comprehensive likelihood criterion was given hy Wilkr 


in tilt* 


form 



The distribution of If may be expressed simply in the following three eases; 


(a) When q ~ 1, 

(b) When q = 2, 

(c) When k = 2, 


p(W) « Jb( a ’ k ~ l )j ‘ ir !,v 4 a 11.in 1 ** * 

p( s IW) = {m-k-\,k~\)l >M> V * % H’|* 2 - 

pod« [»( iV 7~ I “I)! 1 wiix *"'o* 


In other cases, following the previous procedure, it may be shown that approxi¬ 
mate levels for W may be obtained by vising the relation 


ir 



where 


(?'=! + 


1 VI 


(i(Jfe-2) jV(3(*r—1)* — 2) A- +1 - 1 

iw -T + 6(W - i)»" + ;v-T+1 -~ 


*v i 

+ 3(A’ - A- + 1 » i)*j • 


and x 2 has 1) degrees of freedom. Calculations which have been made for 
certain examples show that this method of approximation will give the levels of 
W with sufficient accuracy for most practical purposes if the ratio S;k is not 
less than 10. 


Summary 

The main object of this paper has been to put into easier form for application 
the likelihood ratio criterion l x for testing the hypothesis that all the corresponding 
variances and covariances in a number, k, of multivariate variable normal 
populations are identical. In the case where the samples from each population 
are of the same size {n t = n, t = I,..., k) empirical formulae have been given, 
using which 5 % and 1 % probability levels for 1, may be obtained readily from 
the levels of R. A. Fisher’s z; alternatively the Tables of IM Incomplete Bela • 

When q = 1, W is just the ratio of two sums of squares, familiar in the analysts of variance. 
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Function may be used. The conditions under which these formulae may be 
safely employed are discussed. 

k 

If the samples are large and N = •~2A r log ( J 1 is distributed approxi* 

i 1 

mately as with degrees of freedom /= £(/:-!) q(q+\). The test is then in¬ 
dependent of the individual values of n h which may now differ, If there are many 
variables the sample sizes must, however, be very large for this transformation 
to be justifiable. 

A more accurate approximation, of the type suggested by M. S. Bartlett, in 
the case of a single variable (q = 1), has also been considered, The corrective 
term is, however, rather troubles ome to calculate unless again n, - n{l= 1 

In conclusion I wish to thank Prof. IS. S. Pearson for much assistance in the 
preparation of this paper. 

Additional note 

U. 8. Nair has recently obtained, on a theoretical basis, an approximation to 
the distribution of l v in the case of k equal samples, somewhat similar to the one 
given as Method 11 on pp. BO-fil. 

He finds that the distribution of l x may be taken approximately as 
p(l t ) = const -i (i 

where O' - ~(fr-l) (Ihf- l3?+28 )-^ + + \y 

As far as we have been able to check, this approximation and Method 3 seem to 
be adequate for the same values of n and appear to give levels which are sub* 
stantially the same. 
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OBSERVED AND THEORETICAL RATION IN 
MENDRUAN INHERITANCE 


By E. ROBERTS. W. M. DAWSON* am> 


MAHOAHKT MAUDKN* 


Vmremty of Illinois 


According to the present conception of Mendeliiut inheritance gene* are hu-ated 
on chromosomes, ami if not located on the same chromosome art* mdeja'ietent 
in inheritance. During t he formation of reproductive celts the number i ff cbmum 
aomes is reduced to one-half the number common to the ujieeiea or out* half the 
number found in the somatic cells. As a result of fertilizat ion t he original number 
is restored. 

An allelomorphic pair of genes are genes occupying the siune relative loci on 
homologous, chromosomes, one of which came from the father and the other 
from the mother. The members (alleles) of an allelomorphic pair muter usual 
conditions separate, going into different reproductive cells or gamete* When 
located on different chromosomes the result is as many different kind* of germ 
cells as the number of possible combinations of the genes, except that member* 
of an allelomorphic pair are not found in the same gamete. For any number iff 
allelomorphic pairs the number of different kinds of gametes is 2". in which n is 
the number of allelomorphic pairs. In case of dominance and massiveness the 
number of visible classes or phenotypes formed among the offspring m tin* result 
of random fertilization is also 2". 

The purpose of this paper is to put on record the ratios obtained in several 
mammalian crosses made in genetic studies in the Laboratory of Animal l tenet te*. 
College of Agriculture, University of Illinois, during a jierind of several veam. 

Daily inspection for birth of young was made and number of young recorded. 
Usually two classifications were made, one at 12- 1H days of age and the second 
at the time of weaning, which in the ease of the mice and rats was at. about 2K days. 
Rabbits were weaned at an age of about 8 weeks. Colours and colour patterns 
can be easily classified as soon as the pelage is well formed. Dark eye and pink eye 
are distinguishable at birth and were recorded when the birth records were made, 
In hypotrichotie rata the hair is normal in appearance until between }« and 
20 days of age when deflation begins, iirst noticeable around the head. 

In our records litters were classified as complete and incomplete. A complete 
Utter is one in which ail the young were classified and an incomplete Utter one 
in which one or mo T e had died before classification. Ratios among incomplete 
litters did not differ significantly from ratios among complete litters, ami for 
this reason they are combined in the tables. 

* Formerly Assistant in Animal Husbandly. 
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The symbols used and the characters which they designate are as follows: 

Rats 

A=agouti, a=non-agouti. 

H ™self colour (entirely coloured), h — hooded. 

R=dark eye, r=red eye. 

C=colour, c=albinism. 

H y = haired, h y = hypotrichosis (hairless). 

D= intense colour, d= dilute colour. 

Mice 

A=agouti, a ~non-agouti, 

P, = dark eye, p x =pink eye. 

P 2 = dark eye, Pa—pink eye (a second gene causing pink eye). 

B = black, b = brown. 

Y—yellow, y = non-yellow (black or brown). 


Rabbits 

A = agouti, a=non-agouti, 

E~ extension, e=non-extension (yellow). 

For obtaining probabilities of ratios with two classes the formula 

# = +<) mV(?w), 


in which n is the number of individuals, p the observed proportion of one class 
and f/ the proportion of the other class, was used. For ratios having three or more 
terms, the fit of the observed to the theoretical ratio was determined by Pearson’s 


formula 


A -‘T »», r 


ni[ is the observed number in a class, m r the theoretical or expected mnnber, and 
8 is the sum. V is obtained from Pearson’s Talks Jar Statlstmans and Bio- 
melncians. 
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2. Matings of the type Aa x Aa, producing two phenotypes in the ratio of 
3 dominants to 1 recessive. 


Aa -> A + a 
. . gametes. 

Aa-»A + a ” 

1 AA + 2Aa + laa zygotes. 
Since A is dominant a ratio of 3:1 is expected, 


TABLE If 


Mating 

Dominant 

Rea-naive 

Deviation 

Probable 

error 

1)JE 

lints 






Aa x Aa 0 

1107 

185 

13-00 

8-03 

1-02 

E 

m 

108 




HhxHh 0 

1122 

mm 

14-75 

8-70 

1-70 

E 

030-76 





RrxRr 0 

38!) 


1-25 

0-02 

0-19 

E 

387-7(5 

■S 




Cc x Cc 0 

04(1 

324 


10-47 

0-82 

E 

9r»2-, r ) 

317-5 




Hyhy X Hylly 0 

130(5 

435 

0-00 

12-18 

0-00 

E 

1305 

435 




Del x Dd 0 

<105 

204 

1-75 

8-33 

0-21 

E 

(100-75 

202-25 




Mite. 






P,p,xP,p, 0 

471 

450 

141 

153 

12-00 

7-03 

1:71 

habbila 






Ee x Ee 0 

125 

33 

li-50 

3-40 

1-88 

E 

118-5 

30-5 

... . . . .. 
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3. Matings of the type AaBbxaabb resulting in a tlienretiraS ratio of 
1: 1: 1 : I. 

TABLE III 


Matings 


Hals 

AuHyhy X uahyhy 0 
10 


Hhliylly X hhhyhy 0 

K 


RrHyhy x rrhyhy 0 

i? 


DdHyhy X ddtlyhy () 
10 


CcIIyhy X cehyhy 0 
10 


AaDdxuadd 0 

10 


HhDd x hhdd 0 

10 


RrDd x rrdd 0 

E 


AaRr x aarr 0 

10 


HhUr x hhrr 0 

E 


AaHhxaahh 0 

10 

Mice 

AaP^j x aap,p 1 0 

E 


AaBb x aabb 0 

E 


BbP lPl x bb PlPl 0 

E 




Ally 

Ally 

ally 

137 

175 

1 05 

17f,-2a S 



HHy ! 

Hily 

HH r 

UM 

187 

H r, . 

171-25 

J 

1 

.-I 

HHy 

Rlly 

rHy j 

MM I 
M7 | 

1.71 

ha ; 

DHy 

Dhy 

dll, 

lr.H 

lr.i-r. 

urn 

HI ! 

Oily 

Chy 

dly j 

13 

i:m 

13 

10 

At) 

Ad 

a I) 

IHI 

I7i>-7. r > 

Ill-l 

1x7 

HD 

Hd 

lit) 

1«7 

1411 

133 

ISO'S 


1 

RD 

Rd 

r» 

184 

10(1 

i75 ; 

17(1 



AR 

Ar 

aR 

ISO 

140-5 

140 

147 

HR 

Hr 

hr 

1(17 

147-75 

ISO 

134 

AH 

Ah 

all 

175 

1S5-S 

130 

UH 

: 

AP, 

Ap. 

aP. 

937 

933-5 

909 

9(11 

AB 

Ab 

aB 

935 

933-5 

971 

907 

B Pl 

B Pl 

900 

bP, 

942 

933-5 

95(1 


Silly 


174 

i 3Mi 

M 71 

till. 

J5h 

9 210 

MKJ7 ; 

.1 

eh, i J 

147 

M 

*4 5 #14 a 

dhy 



13 k 

3 211 

«* 

ch, 


j 

12 

n mw 

«( «!*!«« 1 

ad 


i 

1 

lA«i 

if M? 

O 31(8 | 

1 

hd 


j 

| 

i 

133 

12-937 

inm,; I 

rd 



IHI 

2 304 

o .VC j 

ar 



143 

0-172 

11 9*>2 1 

hr 

131 

0 5-13 

1 

«i non - 

ah 


j 

147 

5 72<i 

ti 12 k i 

u Pl 



807 

tmt 

U 075 | 

ab 


| 

921 

2-420 

0-495 ’ 

pb, 

93(1 

1-H29 

| 

wm , 

* « 


.j 
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4. Matings of the type AaBb x AaBb. Expected ratio of 9: 3:3; 1. 


TABLE IV 


Mating 

Phenotypes 

,t' s 

P 

Rais 








AHy 

Ahy 

aHy 

ahy 



AaHyhy x AaHyhy 0 

3(H) 

94 

93 

37 

1-520 

0-681 

E 

200-H1 

1)9-04 

99-94 

33-31 




HHy 

Hhy 

hH y 

hhy 



HhHyhy X Hhllyhy 0 

377 

135 

128 

34 

2-180 

0-539 

E 

379-12 

126-38 

126-38 

42-12 




RHy 

Rhy 

rHy 

rhy 



RrHyhy X RrHyhy 0 

66 

17 

20 

6 

1-044 

0-791 

E 

61*31 

20-44 

20-44 

6-81 




DHy 

Dhy 

dHy 

dhy 



DdHyhyxDdHyhy 0 

111 

42 

24 

14 

5-438 

0-145 

E 

107-46 

35-82 

35-82 

11-94 




CHy 

Chy 

CHy 

Chy 



CcHyhy X CcHyhy 0 

427 

136 

140 

40 

0-232 

0-972 

E 

421-31 

140-44 

140-44 

40-81 




AD 

Ad 

aD 

ad 



AaDd x AaDd' 0 

41 

19 

16 

5 

1-456 

0-697 

E 

45-60 

15-19 

15-19 

5-00 




Rl) 

Rd 

rD 

rd 



RrDdxRrDd 0 

195 

52 

69 

18 

3-119 

0*37(5 

K 

1H7-K3 

62-61 

62-61 

20-87 
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5. Mating of the type AaBbCc x aabbee. ThwreliYal ratio »f 
1 : 1 : 1 : 1 : 1 : 1 : 1 : 1 . 


TABLE V 


Mating 




iOit-niilyixit 




,Y* 

: P 

\ 

Eats 

AaDdHyhy x aaddhyhj 

() 

E 

. 

AI)Hy 

72 

IIH-25 

ADhy 

71 

AdH, 

71) 

Adhy 

02 

aDlty 

7ti 

at)hy 

Hdf{; 

adhy 

m 

3-114 

' It HR) 

' 

HhDdHyhy x hhddhyhy 

t) 

E 

HDHy 

81 

88-25 

HDhy 

85 

HdHy 

70 

Hdhy 

“ 

HDily 

81 

hDhy 

83 

hdlty 

Ah 

hdhy 

03 

ll'Hflt 

<> .id; 

RrDdHyhy x rrddhyhy 

0 

E 

RDHy 

117 

mt 

RDh, 

78 

' 

RdHy 

61! 

Rdhy 

«" 

rDHy 

78 

rl)by 

71! 

rtlHy 

W 

rdhy 

fth 

i m 

« 733: 

HhRrHyhyxhhrrhyhy 

0 

E 

HRHy 

711 

nm 

IIRhy 

80 

HrHy 

HI 

Hrhy 

78 

hRII, 

84 

hRhy 

87 

hrlly 

hr)» 

M 

0 07! 

(i J« 

AaHhHyhy x aahhhyhy 

0 

E 

AHHy 

88 

72-25 

AHhy 

77 

AhHy 

62 

Ahby 

87 

itHHy 

73 

alihy j ithHy 

#) | 84 

! 

ahhy 

07 

7 742 

M-357 

: 

AaHhRr x aahhrr 

0 

E 

AHR 

83 

73-25 

AHr 

83 

AhR 

67 

Ahr 

83 

aHR 

81 

uHr 

75 

a bit 

88 

ahr 

HH 

8 5 IK 

i 

| 

0-482 1 

AaHhDdxaahhdd 

0 

E 

AHD 

81 

70-128 

AHd 

75 

AhD 

65 

Ahd 

88 

aHD 

89 

»Hd 

63 

uhl> 

82 

ahd 

il-K 

» 

0-1*27; 

i 

AaRrHyhy x aarrhyhy 

0 

E 

ARH, 

60 

71-625 

ARhy 

77 

ArHy 

7!) 

Afhy 

85 

aRH, 

74 

aRhy 

70 

aril* 

82 

»rh* 

7* 

3-OH4 

j 

0-813 ; 

AaRrDd x aarrdd 

0 

E 

ARD 

68 

88-875 

ARd 

81) 

ArD 

78 

Ard 

fid 

aRD 

79 

aftd 

(12 

aril 

89 

ard 

« 

30 III 

ll'HiS! 

HhRrDdxhhrrdd 

0 

E 

HRD 

88 

68-25 

HRd 

83 

HrD 

78 

Hrd 

72 

hRD 

57 

HRd 

(1 

hrD 

87 

hrd 

AS 

12242 

o 18*4! 

Mict 












AaDdBb x aaddbb 

0 

E 

ADR 

476 

466-75 

ADb 

462 

AdB 

460 

Adb 

509 

aDB 

487 

«Db 

494 

.. 

adB 

440 

. 

adb 

427 

l"82d 

' 

0-137 | 

j 




Matings of the type AaBbCcDd x aabbccdd. Theoretical ratio of sixteen classes in equal numbers. 

TABLE VI (Rats) 


CD 
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3 
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I 


X 
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.tf 

T3 

T3 


05 

& 


O 

d 
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7. Matings of the type AaBbCcDdKe x aabbccddee. Th(*^rr*ti»v»l r«fw nf 
thirty-two classes in ctjual numbers. 


TABLE VII (Rats) 


AuHhRrDdHyhy •/. aahhrrddhyhy 


PhenotYjH'a 

..‘.J 

t) 

AHRDHy 1 

' ' ~ “” 1 

17 

AHRDhy 

23 

AHRhlty 

20 ! 

AHrDHj 

24 '• 

AhRDHy 

12 | 

AHRdhy 

Hi 

ABrdHy 

23 

AHrDhy 

17 

AhrDHy 

20 | 

AhRDhy 

Ifi 

AhRdHy 

If) 

AHrdhy 

Hi 

AhrdHy 

It 

AhRdhy 

u 

AhrDhy 

1.7 

Ahrdhy 

1H 

aHRDHy 

25 1 

aHRfthy 

24 j 

aHRdHy 

14 | 

aHrDHy 

Hi 1 

ahRDHy 

15 

aHRdhy 

15 

allrdHy 

14 

aHrDhy 

22 

ahrDHy 

15 :i 

ahRDhy 

15 i 

ahRdHy 

17 1 

aHrdhy 

10 

ahrdlly 

13 ! 

ahrDhy 

17 S 

ahRdhy 

Hi 1 

ahrdhy 

17 
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8. When albinism is involved, as in a mating of heterozygous coloured self 
(CcHh x CcHh), any animal which is albino (cc) will not show self or hooded 
though genetically present. The same is true for agouti and non-agouti. The 
theoretical ratio is 9 coloured self: 3 coloured hooded : 4 albino. 


TABLE VIII (Rats) 


Mating 

Phenotypes 

X 1 

P 



CH 

Ch 

c 



CcHh x CcHh 

0 

358 

127 

162 




E 


121-31 

101-75 





CA 

Ca 

c 



CcAa x CcAa 

0 

302 

123 

102 

0-034 



E 

303-04 

121-31 

101*75 




9. Matings of rats heterozygous for colour, agouti, and self (CcAaHh x 
CcAaHh) will give a theoretical ratio of 27 coloured, agouti, self: 9 coloured, 
agouti, hooded : 9 coloured, non-agouti, self: 3 coloured, non-agouti, hooded: 16 
albino. 

TABLE IX 


Mating 

Phenotypes 

X’ 

P 



CAH 

CAh 

CaH 

Cah 

c 



CcAaHh x CcAaHh 

0 

278 

95 

79 

39 

161 

4-263 

0-375 


E 

275-00 

91-09 

91-69 

39-56 

163 




10. The theoretical ratio among progeny from matings of rats heterozygous 
for colour, agouti, and hair (GcAaH y h y x CeAaH y h y ) or heterozygous for colour, 
self and hair (CcHhH y h y x CcHhH y h y ) is 27 : 9: 9:3:12:4, 


TABLE X 


Mating 

Phenotypes 


p 



CAHy 

CAhy 

CaHy 

Cahy 

CHy 

Chy 



CcAaHyhy x CcAaHyhy 

0 

252 

78 


29 

104 

38 

0-867 



B 


81-56 



108-75 

36-25 





CHHy 

CHhy 

ChHy 

Chhy 

CHy 

chy 



CcHhHyhy X CcHhHyhy 

O 

203 

93 


26 

114 

43 


0-710 


E 

270-42 




120-10 





Blometrika xxxi 


5 


















66 Observed and Theoretical Ratios in Mml&an Inheritance 

11. A mating of heterozygous coloured, agouti, self, haired rats 
(CcAaHhHyhy x CcAaHhH,h y ) would give a theoretical ratio of HI minimal, 
agouti, self, haired : 27 coloured, agouti, self, hairless: 27 coloured, .urn agouti, 
self, haired; 27 coloured, agouti, hooded, haired: 9 coloured, agouti, hooded, 
hairless: 9 coloured, non-agouti, self, hairless: 9 coloured, non-agouti, lmod.nl, 
haired: 3 coloured, non-agouti, hooded, hairless: 48 albino, haired: 1U albino, 
hairless. 

TABLB XI 

GcAaHhHyhy x CcAaHhHyhy 

K 

183-,'i 
01-2 
m-2 

fU -2 
2 . 1-1 
2.1-t * 

20-1 
IVH 
108-8 
3 ( 1-2 

7-1112 
0-574 

Among the sixty-five ratios given in Tables 1 XI, five depart significantly 
from the theoretical expectation. D/E for the ratio obtained from Hh k hh is 
4-90, for Ddxdd, 3-92 and for P 2 p 2 xp a p a , 3-28 (Table I). For Hh H y h r x 
hhh y h y , P is 0-027, and for Hh Dd x hhdd, P is 0-005 (Table III). Four of these 
five ratios involve the genes for hooded and dilution. Wherever these genes appear 
in hackcrosses, a deficiency in the phenotypes showing these genes always occurs. 

Among thirteen crosses giving a theoretical ratio of 1 : 1, ton have t he roemsivo 
class smaller than the expected, but only three significantly ho. The sum of all 
these monohybrid backorosses is 9588 dominant#: 9153 ropefwivm IRE * 4-71. 
For the crosses of the type Aa x Aa and which give a theoret ical ratio of 3 : l, a 
total of 5070 dominants: 1677 recessivcs was obtained. In thin cane !i E a 0-407, 
which is a very close fit. 

For the ratios with four classes expected in equal number, P * (HH17B, and 
for the crosses giving a theoretical ratio of 0; 3: 3:1, P * 0-804. For ration 
with eight equal classes, P = 0-0014, and for sixteen classes, P ~ 0-0278. 

When the differences between the observed and theoretical classes are all 
(or nearly all) in the same direction, the larger the number of such divergent 
cases which are added together the greater will be the significance of the departure 
from the theoretical expectation. 


Phenotypes 

0 

CAHHy 

1KU 

CAHhy 

60 

CAhHy 

(S3 

CaHHy 

50 

GAhhy 

18 

CaHhy 

24 

GahHy 

29 

Cahhy 

5 

cHy 

104 

Chy 

38 


A ' 2 

/' 



NOTE ON THE PRECEDING ANALYSIS OF 
MENDELIAN SEGREGATIONS 

By J. B. S. HALDANE, F.R.8. 


The data summarized in Table VII are, I think, unique. However the authors’ 
analysis of them can, I believe, be slightly improved. There are thirty-two 
classes whose expectation is equal, giving thirty-one degrees of freedom. Now 
each of these can be specified, and the appropriate value of y 2 calculated. This 
is done in my Table I. The first five degrees of freedom are the segregations for 
single pairs of genes. Thus there were 277 A and 274 a rats. Hence 


551 


The next ton correspond to potential linkages. Thus the degree denoted as AH 
is due to the dichotomy of the total into 281 AH and ah rats, and 270 Ah 
and all, giving y 2 = 0-2196. Ten more degrees are obtained by con¬ 
sidering the associations of gene pairs three at a time. Thus there were 
270 (AHR + Ahr + aHr+ahR) rats and 281 (aHR+AhR + AHr + ahr), 
giving y 2 = 0-2100. This degree is denoted by AHR, Five degrees are obtained 
by considering four pairs at a time. Thus the degree AHRD is given by 
274 (AHRD + AHrd + AhRd + AhrD + aHRd + allrD + ahRD + ahrd) and 
277 (aHRD-|-AhRD + AHrD + AHRd + Ahrd-|-aHrd-f ahRd-f ahrD) rats, 
giving y 2 = 0-0103. Finally the degree AHRDHy is given by 

278 (AHRDHy + AHRdhy + AHrDhy +... + Ahrdhy +...) 
and 273 (aHRDHy +..'. + ahrDHy +... + ahrdhy), giving y 2 - 0-0454. 

In each ease the principle is the same. We consider the genes 2, 3, 4 or 5 at a 
time, and divide the rats into two groups, one containing an even number of 
dominant genes, the other an odd. 

Biologically the first five degrees of freedom represent differences of viability 
between dominants and recessives, 

The next ten could represent linkages. However, as there is good reason to 
think that the five genes concerned are in different chromosomes, deviations 
greater than could be accounted for by sampling would probably be due to the 
fact that the effects of the different genes on viability were not additive. For 
example, Roberts, Dawson and Madden’s Table I leaves little doubt that h and 
d have an adverse effect on viability, presumably before birth. We further notice 
that the degree of freedom HD lias a large, though not significant, y 2 , due to an 
excess of HD and hd rats. The actual numbers are 168 HD, 137 Hd, 126 hD, 
121 hd. If the viabilities were as 1: 1 -a: 1 —(}: 1 -a-f) we should expect, on 
the basis of the single-factor segregations, to find 161-25 HD, 143-75 Hd, 
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TABLB I 


Analysis of X* Roberts et tiL, I able \ It 



.__. . 


' " 1 


Degree of freedom 

Difference 

■V 

A’ 1 

V 

A 

+ 3 

+0-1278 

0-0103 I 

0-90 jj 

/V 

H 

+ 59 

+ 2-5135 

0-3172 ! 

04)12 

R 

+ r> 

+0-2130 

0-0454 1 

0 83 \ 

D 

+ 35 

+ 1-41)11 

2-2232 \ 

0-14 1 

Hy 

AH 

- 5 

-0-2130 

0-0454 i 

"83 : 

+ 11 

+ 0-408(1 

0-2118:1 j 

II 04 

AR 

-11 

—0-4080 

0-2190 | 

0(44 \ 

AD 

-13 

- 0-5538 

IK9817 

n-fta ] 

AHy 

+ 27 

+1-1502 

1-3158 

0-25 | 

HR 

4- 1 

4 0-0420 

0(8)18 

(4 547 

HD 

+27 

+ 1-1502 

1-3158 

025 

HHy 

+ 7 

+ 0-2982 

0-0889 

0 77 

RD 

- 3 

-0-1278 

0-1)163 

MW 

RHy 

+ 5 

+ 0-2130 

0-0454 

0 83 

DHy 

- 5 

-0-2130 

00454 

tl-a3 

AHR 

-11 

-0-468(1 

0-2180 

004 

AHD 

-25 

-1-0050 

1-1343 

0-29 

AHHy 

+ 10 

+ 0-H094 

0-0552 

<4-42 

ARD 

-23 

-0-9798 

0-9419 

0-33 

ARHy 

-31 

-1-3200 

1-7459 

0151 : 

ADHy 

- 9 

-0-3834 

0*1470 

070 

HRD 

37 

+ 1-5782 

2-4064 

(M2 \ 

HRHy 

-15 

+ 0-6290 

0-4083 

0 53 | 

HDHy 

-13 

-0-5538 

(1-3067 

, 1(58 

RDHy 

-31 

-1*3206 

1-7459 

! 0-19 

AHRD 

- 3 

-0-1278 

0-0163 

IHK> 

ARDHy 

-31 

-1-3206 

1-7459 

0-19 

AHDHy 

-13 

-0-5538 

0-3(817 

1 lt-58 

AHRHy 

-23 

-0-9798 

04)419 

0-33 

HRDHy 

+ 17 

+0-7242 

0-5245 

| (l 47 i 

AHRDHy 

+ 5 

+ 0-2130 

0-0454 

j U-M3 1 

Total 31 

- 7 

-0-2130 

25-639 

j “““ 

J 0-70 


131-75 hD, 114-25 hd, which would give %* * 0 for the degree of fmxiuiu HD. 
If the viabilities were as 1: 1 - a : 1 ~ji : (1 --a) (1 -/i), which is jKirhaps n more 
plausible hypothesis, we should expect 1(12-11) HD, 143-kI Hd, 131-Ki hD, 
114-19 hd, giving y* - 0*001, So the two hypotheses are indwtinguwhable 
except in enormous samples, 

The remaining degrees of freedom represent similar biological possibilities 
as to the non-additive character of the effects of various genes on differential 
viability, Fisher (1925), who was the first to point out the method here employed 
for the analysis of y 2 , states that these degrees have “no simple biological mean¬ 
ing”, However, the positive value for the degree HRD could be mainly due, for 
example, to the fact that hRD rats, of which there were only 58 m against an 
expectation of 68-875, are more inviable than was to be expected from the effects 
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of the genes one at a time, which would give an expectation of 66, Such inter¬ 
actions have, of course, been observed by Gonsalez (1923), Timofeeff-Ressovsky 
(1934) and others. 

In my Table I the values of P in the fifth column are read off from those of y 
in the third by means of a table of the probability integral, except the last entry, 
which is calculated by Wilson and Hilferty’s (1931) theorem. There are no 
suspiciously high values of P, and only one (for the degree H) which is significantly 
low. Actually a glance at Roberts, Dawson and Madden’s Table I shows that had 
their Table VII been based on a larger sample the values for H and D would 
almost certainly have been significantly low. 

In the earlier tables some rats are included which do not figure in Tables VII 
and XL Hence for a full discussion of the data each table would have to be 
analysed separately. 

We can now criticize Roberts, Dawson and Madden’s final analysis of the 
data. They point out that the total y 2 for segregations involving eight equal 
classes, i.e. three genes at a time, is unduly high. We can see why this is so. 
Supposing we extracted from Table VII the data on segregations of three genes 
at a time we should obtain ten tables with seventy degrees of freedom. If these 
were analysed into their components we should find that we had counted the ten 
degrees of which AHR is typical once each, those of which AH is typical three 
times each, and those of which A is typical (i.e. single gene-pair segregations) 
six times each. We have thus given a quite undue weight to those degrees of 
freedom which actually show the greatest deviations. Actually each deter¬ 
mination of y 2 by the authors involves one degree of freedom not dealt with in the 
earlier tables, while the remainder refer to segregations already considered, and 
often on larger samples, in their earlier tables. 

Thus, to take an example, the rat mating AaHhDd x aahhdd of Table V 
has a y 2 of 11 -31 for seven degrees of freedom. But the only new information in 
Table V relates to the difference between AHD +Ahd + aHd + ahD and 
aHD + AhD +AHd + ahd, giving y 2 = 1-941 for one degree of freedom. All the 
other information Is already given in Table III, along with some more, for 
example, concerning rats segregating for A and D, hut not for H. The correct 
method of collating the data is therefore to add up the values of y 2 for the last 
degree of freedom in each case. 

The expression for the last degree of freedom in a P 2 mating such as 

AaDdxAaDd can easily he shown to be, This 

J 9(AD + Ad + aD-f ad) 

expression has fi x - and a nearly normal distribution. Similar expressions 
can be written down for larger P 2 ’a. Where the mating involves epistacy the 
formulae for the last degree of freedom are similar. For example, the first mating 
of Table VIII involves two degrees of freedom, the degree C for the segregation 
of C and c rats and the degree H(C) for the segregation of H and h among G rats. 



70 Note on the preceding amlym of MewhUnn m grr-yntitm 


TAHLK II 

Vtihm of ,\' 2 for single degrees nfjfrmiow. oil rfnUi 


Animal 

Degree of freedom j 

j 

A 4 

Rat 

A 

2*4511 

II 

H 

10-blti 


R 

<1*353 


G 

U-HH 

II 

Hy 

(1*427 

tl 

D 

(i*if!7 

Mouse 

A 

t-tiRti 

»» 

Pi 

t*(ILM 

ii 

B 

<HI70 

,, 

P 2 

(J*|Kll 

II 

Y 

(1*032 

Rabbit 

E 

(1*177 

♦ ♦ 

A 

f)*KK!l 

Rat 

AA 

1*138 

H 

HH 

1*3117 

»t 

RR 

(1*010 

♦ 1 

CC 

(1*177 

II 

HyHy 

IHKKI 

>1 

BD 

0*020 

Mouse 

P,P 2 

1*255 

Rabbit 

EE 

1*420 

1 factor 

21 

31*012 

Rat 

AHy 

(1*029 

II 

HHy 

0*027 

H 

RHy 

0*061 

»l 

DHy 

0*323 

II 

GHy 

0*296 

II 

AD 

0*175 

II 

HD 

2*532 

II 

RD 

1*471 

II 

AR 

0*001 

II 

HR 

0*387 

>1 

AH 

0*778 

Mouse 

AP, 

4*252 

II 

AB 

0*130 

II 

BP, 

0*130 , 

Rat 

A^y, 

1*308 ' 

II 

H i Hy s 

1*852 

II 

R t Hy s 

0*081 

II 

D 2 Hy,, 

0*88fi 

ll 

C a Hy„ 

0*025 

M 

MV 

0*657 

ll 


0*012 


Animal ! 

tk*|itrr id freedom 

X* 

2 faclram 

21 

10 572 

Rat 

ADHy 

i 

»» Ux 

,, 

Hlitly 

o 20 1 


Rimy 

i: in 

,, 

HRltj 

«< Ml 


AHHy 

M 692 


AHR 

*< 2iH 


Aim 

t sm 

M 

A RHy 

2^31 

II 

ARt) 

i j:h 

♦ I 

llRD 

2 m;w 

Mmtw* 

ADR 

2 Thu ; 

! 

3 fitel«r* 

II 

! 

13 303 ; 

Hat 

ARDMy 

j 

1 7 Ut 

i) 

HRDHy 

‘(525 

11 

AHDHy 

ii*3n7 , 


A H RHy 

" t.it;: * 

41 

AHR!) 

0WA 

4 factors 

! | 

8 ; 3 335 

*" 


Rat 

AHRDHy 

it ms 

Rat 

. ** 

HjRVj 

•? 362 ' 

f » 

A;(C,i 

Oram 1 

I* 

AjlljtGi 

2 530 ; 

H 

AjHy s iC.i 

0 18 M 

ll 

}t,liyjci 

l ?mi 


A ,t 1 j H y ,j( ■ i 

•im 

Epislftcy 

6 

l 

8 180 

Grand total 

65 

' 

75-35 

~r 

. .. - ’ .. 

- - ■ • 
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This would be affected by linkage. Its y 2 


4(CH~3Ch) 2 


Similarly, for Table 


!)(CH + Ch +c)' 

IX the last degree of freedom involving the association of A and H among G 

4(CAH - 3GAH - 3CaH + ilCah) 2 


animals is 


r 


27(CAH + CAh + GaH + Cah + c) ‘ 

The last component degrees of the sixty-five segregations of Tables I-XI 
are given in my Table II. Whereas Roberts, Dawson and Madden, for the reason 
given above, find a significantly large total y 2 , I do not. The probability of 
y 2 = 75-44 for sixty-five degrees of freedom is P = 0-18. The whole excess is due 
to the single-factor segregations, and of these, one gives a significantly large y 2 , 
while another probably does so. Of the forty-four degrees of freedom involving 
more than one factor the largest is 4-252, and one value as large as this is to be 
expected. The order of the degrees is taken from Roberts, Dawson and Madden, 
If we include the eight eases involving epistacy with the others involving the 
same number of factors (gene pairs) we find the results summarized in my 
Table III. 

TABLE III 


Summary of y* values 


Number of factors 

Number of degrees 

Total y 1 

segregating 

of freedom 

1 

21 

31-912 

2 

23 

1(5-942 

3 

14 

20-635 

4 

fi 

5-913 

5 

1 

0-045 

Total 

(55 

75-36 


It will be seen that, except for the single-factor ratios, there is no evidence of 
deviation from Mendelian expectations, and on the whole the fit is very much 
better than would appear from tho analysis of Roberts, Dawson and Madden, 
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THE USE OF STATISTICAL METHODS IN THE INVESTI¬ 
GATION OF PROBLEMS OF GLASSIFICATION IN 
ANTHROPOLOGY 


PART I, THE GENERAL NATURE OF THE MATERIAL AND THE FORM 
OF INTRAKACIAL DISTRIBUTIONS OF METRICAL CHAIIAITKUS 


By G. M. MORAN'T 


Every naturalist who lias had the misfortune to undertake (lie d<wrifi(i«*ft of » of 
highly varying organisms, him enemmtered eases (1 ajH'ak alter ex|»«r»*nr*«) |*twi*4v like 
that, of man; and if of a euutimw iliH|M»itwm, he wilt end hy muting all the forms wMi 
graduate into each other, under a single aperies; for lie will nay to himself that le> }«w in 
right to give names to objects which lie cannot define. 

t’HAfii.KH Darwin. The !kmn («/ .V«n, ln?l 


1, INTRODUCTION 

The main aim of physical anthropology in to unravel the wmn# of hmtiau 
evolution, and it may be taken for granted to-day that the proffer study of the 
natural history of man is concerned essentially with the mode and the path of 
his descent, The general method employed is to compare the physical dmreotors 
of suitably chosen groups of individuals, and to discover the interrelationship* 
of these groups from an interpretation of the differences found between them. 
Difficulties are encountered at the outset owing to the nature of the wntimtum 
of which the component parts have to be compared. The groups which arc most 
suitable for the purpose in view can easily lie recognized in a general way. but it, 
is difficult to define them with precision. There is a lamentable lack of agreement 
among anthropologists to-day regarding the way in which population* suitable 
for the purpose of investigating moss descent can best he discriminated. 1 la ru in 
stressed the difficulty of the problem, and it would lie idle to hope that any 
simple solution of it has been overlooked. 

This paper provides a discussion of the statistical approach to amhrupo- 
logical taxonomy. The general thesis maintained in it is that the comparatively 
new method can be used in a systematic way to discriminate suitable group*, to 
reveal their interrelationships, and hence to disclose the course of racial history 
in so far as adequate evidence is available. 

It is now more than forty years since Karl Pearson first applied to racial 
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material the statistical methods which he established and greatly extended.* 
Recognition of the value of his procedure was slowly won, but the methods are 
almost universally accepted to-day. It may even be said now that the advantages 
of their use in physical anthropology are generally taken for granted. There is no 
longer a need to emphasize the value of measurements, or to point out repeatedly 
the futility of attempting to derive any useful conclusions from evidence as 
scanty as that on which many earlier theories were based. 

Karl Pearson’s teaching in this field has been most widely accepted in so far 
as methods of reducing group data are concerned, but it is one thing to describe 
and another to interpret in anthropological terms the situation observed. With 
regard to such interpretation, he indicated the lines along which he expected that 
progress would be most profitable, but he did not codify a system or lay down 
any rigid rules for the guidance of those who wished to follow him. His ideas 
with regard to this matter could only be grasped by observation of the ways in 
which he treated particular sets of material. Meanwhile, different anthropo¬ 
logists were drawing deductions from statistical evidence in a variety of ways, 
some of which are entirely at variance with wluit came to be known as “ bio¬ 
metric” practice, and many of which are irreconcilable inter se. The position 
became chaotic, and it is still in this state. 

The view advocated in this paper is that the ways in which statistical 
methods may be used to supply valid and useful anthropological conclusions can 
only be determined from wide application of these methods to suitable material. 
No a priori considerations are likely to be of much help here: the nature of the 
situation has to be examined thoroughly before it is possible to decide on the 
best ways of treating it to give results of the kind required. The short history of 
the subject bears clear witness to the fact that the empirical test is always the 
crucial one. The nature of anthropological groups has now been sufficiently 
explored, and certain methods of interpretation have now been sufficiently ap¬ 
plied, to make possible a just assessment of the value.of a particular procedure 
due primarily to Karl Pearson. He repeatedly stressed the need for adapting 
statistical theory to practice, and we may follow him in deciding that the value 
of any new descriptive methods, or modes of interpretation, suggested must he 
judged from a sufficiently wide application of them. 

The statistical nature of anthropological groups will be discussed first, and 
certain simple but important generalizations which are often ignored by 

* The first general treatment of the topic was given in a paper by Cicely 1). Fawcett and others 
(1902) which Karl Pearson edited and arranged. He had previously applied statistical methods to 
anthropological material in The Chances of Death and other Studies in Evolution (1897) and in ft 
series of papers in the Proceedings and Transactions of the Royal Society of London, Quetelet 
(chiefly in 1835 and 1871), Galton (in various publications), Stieda (1883) and Witt (1879) hftd 
previously used statistical methods in discussing anthropological problems, but they were either 
not concerned with problems of racial differentiation or else they considered such problems only 
in a cursory way. 
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anthropologists can be formulated at this stage. In the second place, different 
methods of reduction will be considered, and, lastly, the anthropological con¬ 
clusions which may legitimately be derived from the data reduced in that way 
will be discussed. In this Port I of the whole paper topics discussed mw-wii the 
selection of anthropological samples (§ 2). general eoimidmftotw lcgarding the 
treatment of the samples chosen (§ 11), and the forms ol intrararial distributions 

(§4). 

2. The selection of anthroi’OLouH'ai. samm.bs 

Anthropology has been defined as the study of groups, and this definition is 
appreciated at once by the statistician, since bis methods are essentially designed 
for the treatment of group data. In a consideration of anthropological material 
the nomenclature of the statistician may he used with advantage; the nature ol 
the processes of description and analysis is thereby made dearer and certain 
ambiguous biological terms are avoided. 

A population is defined to be any assemblage of individuals considered and 
treated as a single group. It may be large or small, ami there may be little or 
much justification for treating it as a single group. The term is a general, but 
unambiguous, one which can be conveniently used in practice. A ntmpk is 
made up by a number of individuals selected from a population, amt the select ion 
is said to be random, when it is believed that the population «« a. whole is fairly 
represented, so that there is no bias favouring any special sect ion of it, A samples 
of individuals may be said to form a series, and these two terms can often He 
interchanged without loss of perspicuity. 

The general method of the physical anthropologist in dealing with new material 
is: (a) to select a sample at random from a particular population ol' a suitable 
kind, (6) to describe the characters of the individuals comprising the sample, 
(c) to infer from these observations, with greater or less accuracy according to 
the size of the sample, certain characteristics of the population sampled, (d) to 
make comparisons between the evidence so obtained and that available for 
other populations described in the same way, and (e) to deduce from these 
comparisons the biological relationships of the new population. It in nwwtary 
to make clear distinctions between these successive processes, and failure to do 
so has sometimes led to confusion, 

The description in statistical terms of the first process, i.e. that,of sampling, 
is perfectly precise providing that there is some means of ensuring that the 
samples are taken at random, but in applying the process to his material the 
anthropologist is faced with another difficulty, How is he to distinguish the 
populations from which suitable samples may be taken? In general very little 
consideration has been given to this question. In comparisons made between 
samples selected in different ways and representing different kinds of populations, 
differences between the kinds of sources from which they were derived have 
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generally been ignored. But in fact such differences are of vital importance, aind 
half the difficulty of the whole process of the statistical analysis of anthropo¬ 
logical material is overcome if an effective way of dealing with the problem of 
selecting suitable samples can he devised. The contention made here is that 
owing to diversity in the modes of selection of the samples commonly used by 
anthropologists, there is often little justification for applying statistical methods 
to them in a rigid way. 

For the anthropologist the ideal population would be one made up by a 
number of individuals having a common descent. In the most favourable 
circumstances such communities cannot be distinguished at all exactly, however, 
and in the initial stage of his enquiry the anthropologist is generally quite unable 
to delimit with any approach to precision groups defined in such an abstract 
way. In practice far less stringent conditions have to he accepted. It may be 
said that a community made up by a number of individuals whose ancestors—or 
the majority of them, at least—are believed to have intermarried for a consider¬ 
able number of generations will he a suitable one to accept as a unit group from 
which a sample may be taken. Such groups have to he chosen as carefully as 
possible, having regard to any relevant evidence available. But populations of 
very different sizes will satisfy the condition stated. The total population of a 
province and also a small parochial community forming a special part of this 
total might be considered, and it is possible that these are different in nature and 
that different conclusions would he derived from them. A further condition, 
then, which should he satisfied wherever possible, is that the population con¬ 
sidered is a large one of the regional rather than of the parochial kind. Experience, 
discussed below, has shown that large communities are always more suitable for 
consideration than small ones when the purpose in view is the analysis of more 
remote origins. But if the only sample which can be obtained does represent a 
population of the parochial kind—as may he the case if a series of skeletons from 
a single cemetery is the only material available-then it should he fully recognized 
that this may have been, a special part of a larger population which might more 
profitably bo considered as a unit group, and allowance should be made for the 
peculiarity of the source. 

Having selected a number of samples from what appear to be suitable 
populations, the anthropologist may compare their statistical features, such as 
the forms of distributions, measures of variation and correlations they supply. 
If the majority of the samples are found to possess certain particular cha¬ 
racteristics, while departures from the rule are occasionally met, then it may be 
legitimate to conclude that the peculiarity of the exceptional cases is due to the 
fact that the populations they represent are unsuitable for the purpose in view. 
The empirical investigation of samples known to represent unsuitable popula¬ 
tions can aid examination of the matter. The practical procedure which appears 
to be most profitable is this: samples believed to be of the right kind are selected 
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and then certain tests, derived from experience oi the statistical characteristics 
customarily exhibited by such samples, are applied in order that abnormal ones 
may be revealed and rejected as unsuitable for further use. 

The initial process of selecting samples must be considered rather more fully. 
In practice it may he advisable to take, into account information of several 
kinds. The most important factor is usually geographical position, hut the 
separation of series representing different social classes may he desirable, and 
archaeological, historical or linguistic evidence may suggest that the parti¬ 
tioning should be carried out in a particular way. In dealing with skeletal 
material the time factor may be an important one to consider; it may Ik; well to 
treat a pagan and a Christian Anglo-Saxon series separately, for example, and 
not to assume that the two represent indistinguishable populations. 'Dm general 
rule is that the total series for which data, are available should he split up in a 
natural way into as many subseries representing distinct populations of a, 
considerable size as possible, providing that each mil merles select ed is large 
enough to give comparisons of value. The way in which this can liest he dune 
depends on circumstances which are peculiar to each set of material. The 
anthropologist is thus obliged to plan his survey of a particular region with 
evidence of several different kinds in view. The majority of these tin not relate to 
physical characters, but he has a perfect right to consider, its relevant to his 
biological enquiry, any evidence which gives some indication of the subgroups of 
the total population which are such that intermarriage normally takes place, or 
took place, within rather than between them, 

The procedure described is necessarily rather vague and of an arbitrary 
nature: any subdivision adopted initially is experimental. In practice choice is 
largely controlled by the amount of information available, Suppose, for 
example, that the total sample for which measurements are available represents 
people coming from all parts of a particular country. If there are enough 
individuals they may be divided into subsamplea on a regional basis, or the 
population of oach region may be thought of as subdivided into county groups, 
say, and each of these may be further subdivided into parochial nommumtitw, 
which may bo split up into small groups of closely interrelated people. A 
hierarchy of groups within groups can thus he imagined, but in practice it w ill 
not be possible to carry the process of subdivision beyond a certain stage, a« 
further dissection would lead to subsamplea too small to yield etunpnrijwms of 
value. There can be no assurance that the divisions based on geographies! or 
other considerations will be the most effective for the purpose in view, but some 
such divisions have to he adopted in order to disclose the nature of the total 
population. 

At the outset of his survey of new material relating to the population of a 
particular region, the anthropologist thus requires a knowledge of the size of the 
smallest series which can be used profitably for the purpose of the classification 
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in view. This minimum requirement cannot be determined from any a priori 
considerations: the point 1ms to he settled after an experience has been gained 
from the comparison of series of different sizes. The conclusions reached in this 
way are discussed below. 

However ample his material is, the anthropologist engaged in investigating 
remoter origins and relationships should choose his unit groups, ivhenmr 
possible, in such a way that they relate to populations of a considerable size. 
Family and parochial groups should certainly be considered too small. The need 
for restricting comparisons to larger groups than these if possible is evident on 
account of both a priori and a posteriori considerations, but it is frequently 
overlooked.* It should be anticipated that series representing small com¬ 
munities are likely to be biased, not random, samples of regional populations. 
The existence of local variants of a widespread population derived from a single 
source has to be recognized, and hence special precautions have to bo taken in 
drawing deductions from samples representing local groups. In conformity with 
these expectations, it is commonly found in treating a total group of the 
national kind that samples representing small subgroups of it tend to differ 
more than samples representing large subgroups. It is customary to find in such 
a case that up to a certain point there is greater uniformity in the statistical 
attributes of the samples—quite apart from fluctuations due to random 
sampling—according as the groups represented by the samples become larger. 
Greater diversity has to be expected when smaller subgroups are compared. If 
the anthropologist concerned with the broader taxonomic problems of his 
subject treats groups of the family or parochial kind, lie will thus be in danger 
of losing his way among the trees when he should be taking a bird’s-eye view of 
the wood as a whole. 

It is unlikely that, an example of the statistical conception of a perfectly 
homogeneous population is ever encountered in dealing with anthropological 
material. The kind of continuum which has to be analysed by making comparisons 
between its component parts is of a peculiar nature. It would lie an advantage 
if the subsections of it considered for the purpose of racial classification were 
always populations of a considerable size, and for many sots of anthropometric 

* The writer had overlooked it at one time, and ho was accordingly admonished in a letter from 
Karl Pearson dated 17 July 1024, in which the following passage ooours: 

“May I give you an analogy? In a little Yorkshire village there are two manors—two small 
squires, little better than yeomen—and they kept on intermarrying their family members. About 
one-third of the churchyard contains, for at least two centuries, the graves of these folk, You could 
get at least 30, and possibly 100, crania from that third of the ehurobyard which would differ very 
sensibly from the skulls in the remainder of the yard. It would arise solely from the fact that we arc 
dealing with an inbred population, which possessed certain characters which raised it above, or at 
least differentiated it from, the remaining population. I don’t suppose there was any racial 
difference between these little squires and their neighbours, they were all ultimately of Danish 
descent. But simply one or two interbreeding families were buried in one earth. Now my analogy 
has for its bearing the danger of picking out 20 crania and saying these differ from the rest.” 
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data this condition is satisfied. Not infrequently, however, and particularly in 
the case of excavated skeletal material, the only samples available for a particular 

regional group are of the parochial kind. 

The Anglo-iSaxon skeletons preserved in museums net unity * mm* irmn. a 
number of scattered cemeteries, and there is no long series from » single 'cmefciy. 
The fact that they were dispersed should he considered a real advantage when 
the object in view is to determine the characters of Anglo Saxons in general, or 
of the larger subsections of this total population. If all the sprimeim had cone 
from a single cemetery it would have been difficult not to assume ilia* the senes, 
particularly if it were a large one, could lie accepted Janie ik minor as 
representing the total Anglo-Saxon population, and to draw conclusions irmn a 
comparative study on this assumption. In fact there is good reason to suspect 
that the physical characters of the, people forming any small Anglo Saxon 
community differed appreciably in their averages from those of any large section 
of the total population. 

In practice it is seldom possible to exercise much choice in selecting muuphw. 
In many instances the only material available representing a past imputation of 
the national kind consists of one or more series of skeletons from one or two 
cemeteries. The anthropologist cannot he expected to neglect all wriw which are 
not ideally suitable, (nit he should remember that several he is obliged to use are 
not ideal for the purpose in view. Allowance has to be made for know u differences 
between the samples used dependent on the diverse ways in which they were 
selected. 


3. The statistical treatment ok AtmiRimn.omrAL samtlkh 

In treating his problems of classification the nnUm>jM»U>g»Ht starts by 
selecting series of individuals representing different populat ions which appar to 
be more or less suitable for the purpose. Data have been recorded for large 
numbers of such samples relating to living peoples in different parts of the world 
and to extinct peoples represented by series of skeletons. It is reeuguiwd that 
these samples are taken from groups of different kinds, and t hat allowance will 
have to be made for this fact in comparing them. Apart frmn thin diversity 
which is appreciated, there is no guarantee that all the samples am suitable fur 
the purpose of examining group relationships. Some may he entirely unsuitable 
because they represent populations of exceptional kinds, and t heir jKu-ulmrity in 
origin has to be detected by examination of the samples themselves. For 
example, a certain number may be found to have peculiar characteristics which 
suggest that they should not be used, while the majority conform to a particular 
type. In practice it is necessary to appeal to experience derived from the treat¬ 
ment of a large number of series in order to justify the use of certain testa 
employed to discriminate between suitable and unsuitable material. 

A question which might be asked is; What is the nature of anthropological 
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samples selected in the way described?—what features are common to all of 
them and in what ways do they differ? The statistician—who is accustomed to 
thinking of problems in terms of populations and samples—points out at once 
that, though this question may be of the right kind, yet it is not put in the right 
way. He reminds the anthropologist that his ultimate concern is with popula¬ 
tions, and that the samples are really of interest only in so far as they supply 
information regarding populations. A moment’s consideration shows that this 
is, or should be, the view of the anthropologist. If he collects information 
relating to 100 Greenland Eskimos, say, it is only in order to arrive at generaliza¬ 
tions regarding either all Greenland Eskimos, or else some section of them which 
is still a population many times larger than the sample observed. Hence the 
question should be: What generalizations regarding populations can be deduced 
from samples selected in the way described?—what features are common to all 
the populations and in what ways do they differ? 

The fact that his ultimate concern is with populations and not with samples 
of them is not likely to be entirely overlooked by the anthropologist, but there is 
a real danger that he may assume in particular instances that the characteristics 
of samples are precisely the same, as those of the populations they represent. 
Statistical treatment has a great advantage in this connexion, since it keeps the 
distinction referred to continually in view and provides a systematic and—as far 
as circumstances permit—precise method of reaching the generalizations required. 

In this paper the only kind of information regarding samples which will be 
considered is that provided by measurements, whether of series of living people 
or skeletons. The general problem can be viewed in the same way if non- 
metrical characters are dealt, with, but a different statistical treatment is re¬ 
quired for them. All the individuals referred to will be supposed adult, and for 
such the measurements can he supposed, in general, to be unaffected by the age 
of the individual. The data for males and females have to be considered sepa¬ 
rately, but the generalizations reached are the same for the two sexes. Data 
relating to numerous series are available for a particular set of characters 
commonly recorded by anthropologists: nearly all of these measurements 
concern either absolute size (chords and arcs) or shape (indices and angles). In 
the ease of the skull, for example—the data for it being more abundant than those 
for all other bones of the skeleton put together—the measurements are designed 
to give a description of the size and shape of the skeleton of the head considered 
as a whole and of all its principal parts. The majority of the anthropometric data 
for living people commonly recorded provide indirect measures of the size and 
proportions of different parts of the skeleton. 

With the object of obtaining information regarding the distribution of a 
particular character in the population represented, the following features of the 
distribution provided by a sample will be considered: 

(i) its form, 
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(ii) its scatter, that is to say the variation exhibited by the re»4iti«« fnr the 
individuals composing the sample, 

(iii) its central tendency, that is to say the value of the character {or moots) 
about which the readings can most conveniently he considered to 1«* scattered. 

These three features of the distributions are first considered in the ease rtf 
characters dealt with singly. It is also necessary to gain information regarding 
the ways in which different characters arc associated in individual*, and hence 
it is necessary to examine: 

(iv) the correlations of different pairs or groups of characters in individuals. 

A sample of the kind so far considered is made tip hv a number of individuals 

believed to belong to a population such that its memlsTs have been chiefly 
intermarrying with one another for a number of generations. This is called an 
intramid sample—a rather unsatisfactory term to use at this stage* since the 
concept of race has yet to he defined. An examination of the forms of the 
distributions and of their variabilities in the case of metrical characters con¬ 
sidered singly, and for a considerable number of intraracial samples, shows that 
the only comparisons of them likely to give results of any nuthropttloginri interest 
must be of a certain kind defined in later sections of thin pnjw, NutiHtic.nl 
treatments of other kinds are seen to bo unprofitable. The same evidence makes 
it clear that the most appropriate measure, of the, centra! tendency of any mw of 
the distributions to use is the arithmetic mean, or average. 

Attention is thus focused on mean measurements, and different ways of 
treating them have to be considered, The conception is introduced of samples for 
which the units are not actual individuals—as for in! raraeial samples but* 
abstract beings such that each has metrical characters equal to the averages for 
the particular group which it represents, This idea of Vhomme. mmjm is not new 
in anthropology: it was clearly enunciated more than UK) yearn ago by Quetelet 
(1835). An assemblage made up by a number of homnm mayrm representing 
different populations is called an interracial sample, and in the caw of a, pa rticular 
character the mean measurements for a number of samples representing different 
populations will provide an interracial distribution. 

In order to gain a more complete knowledge of the statistical nature of 
anthropological samples, required for the purpose of determining the ways in 
which they can best be treated in practice, it is necessary to take eognistanec of 
other features of them in addition to those listed above. Interracial considera¬ 
tions are involved in examining: 

(v) the ways in which the averages for different characters distinguish 
certain sets of populations, 

(vi) the forms and variabilities of interracial distributions, and 

(vii) the interracial correlations of characters. 

In the course of the investigation of these topics it is possible to obtain 
estimates of the minimum sizes of samples which are required, under different 
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conditions, in order to provide intergroup comparisons of value. One of the 
advantages of the statistical approach to the problem of classification is that it 
makes it possible to determine with some precision the least amount of evidence 
which must be available in order to yield useful conclusions. The history of 
physical anthropology bears clear witness to the fact that some control of this 
kind is a vital need. 

In the following section of this paper the forms that occur in practice of the 
intraraciai distributions of anthropometric characters are discussed. It is hoped 
that this will be followed in later parts by discussions of the other topics listed 
above—viz. other intraraciai characteristics of distributions and characteristics 
of interracial distributions—and, finally, by a discussion of methods of comparing 
samples. 

4. The forms of intraraciai. distributions 

It can be stated categorically that the distributions of measurements for the 
vast majority of samples that occur in anthropological practice tend to conform 
closely to the normal curve. Quetelet's suggestion (1871) that this is so has been 
confirmed by data relating to numerous series of living people and skeletons from 
all parts of the world. In general, the samples from a particular population tend 
to give a closer and closer approximation either to the normal or to a very similar 
form of continuous curve according m their sizes arc increased. Hence it is safe 
to infer that the characteristic type is unimodal and symmetrical. This generali¬ 
zation applies to all absolute measurements and, in spite of a theoretical quali¬ 
fication mentioned below, also to all measurements of shape (indices and angles) 
for which the matter has been adequately investigated. 

There has been considerable confusion regarding interpretation of the forms 
of frequency distributions in the discussions of this topic provided by some 
anthropologists to whom statistical conceptions were unfamiliar. In the early 
days of biometry the fallacy of drawing conclusions of certain kinds from 
peculiarities of the curves provided by samples was repeatedly stressed, but this 
warning is still occasionally ignored. The chief errors made with regard to the 
matter have boon in supposing («) that a small sample—made up by fewer than 
100 individuals, say—is capable of giving an adequate estimate of the form of 
the distribution in the population sampled, and [b) that mere inspection of the 
diagram for a sample is sufficient to reveal the information required, In this 
connexion numerous examples may be found of the fallacy of supposing that the 
features of a sample are precisely the same as those of the population from 
which the sample was taken. The general appreciation by anthropologists of the 
perfectly clear implications of a simple sampling experiment such as the one to 
be described would still save much fruitless discussion. 

The head lengths and breadths of 1314 soldiers who were natives of Lanark¬ 
shire have been given by Tocher (1924). The order in which the first 700 men are 
arranged in the table appears to have been entirely random as far as the 

Biornctrika xxxt 6 
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measurements are concerned* The cephalic indices were calculated for this 
sample and Fig. IA shows the distributions obtained by taking iwmwvr groups 
of 50 from the first (nos. 1 50) to the fourteenth funs. OAI TMO) Ksn-h one of 
these subsamples provides an estimate of the distribution ot the repludu- index 
for Lanarkshire men in general, and any one of these stihsamplw might, have 
been the only one available relating to the total population, The distributions 
in Fig. 1A clearly exhibit a great diversity of forms. They are all alike in showing 
a majority of individuals with indices between 75 and HU, and in either covering 
a continuous range or showing a few outlying values detached from the main 
body. Otherwise, there is little agreement between them. Smut* show « more or 
less gradual rise to a maximum frequency and then a gradual fall (a* tm. X), 
while others appear to have two (nos. XII and XIII) or mure (now. III and XI) 
distinct peaks. The fact that inconsistencies are found in these res)ret* makes it 
obvious that the series aro too small to provide any reliable information an to the 
existence of such features in the distribution for the. population they »H 
represent. 

Fig. lB shows the same 700 measurements treated in successive sulwurmplc* 
of 100, so that the first series (F) is the sum of the first two (I and II) in Fig I A, 
and so on. The distributions for these larger series show mure uniformity the 
maximum frequency being for the range 77 7H in four eases out of the woven, for 
example—-but they still differ very appreciably among themselves in wine 
respects, and one (VI') appeal’s to bo clearly of a hi modal form. It i» evident 
that the measurements of more than 100 individuals art* required to give any 
reliable indication of the details of the distribution of the cephalic index in the 
parent population. 

The top and the middle distributions in Fig. 1C relate to the samples made up by 
the first 350 (I") and the second 350 (II") Lanarkshire men, respectively. Both 
distributions appear to be of symmetrical form, hut the first slums an out¬ 
standing maximum frequency while the second is "flat- topped" (plalykurtic). 
The conclusion must be that it is unsafe to draw deductions regarding wune 
features of the distribution for the parent population by merely nwjwting the 
distributions q£ samples, even if they are as large m these two. Thu bottom 
diagram in Fig. 10 relates to the total sample of 700 mm considered. It apjmara 
to be slightly asymmetrical, but comparison with the corresponding areas of the 


After some number a little above 700 the order in which the individuals an* arrange* 1 m dearly 
not random. They were evidently taken in successive small grouj* *qeh that each gnmj* had a 
restricted range of head lengths, anti consequently the distribution# ot this cephalic index for 
samples of ft) (nos. 701-50, and so on)'aw very peculiar. Dr Tocher tan kindly answered my 
enqumes with regard to this matter,ind he informs me that the departure from nmdumnm* waa 
not appreciated when the measurements wore taken. It may have been due to the fact that, at mm 
depot the officers arranged the privates measured in hatches having hats of the ms tits, in the 
belie that this would help the anthropologists! Departures from a random mbcUun which may 
atteot the distributions of characters are easily overlooked. 
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Fig. 1. Distributions of the cephalic index for subsamples of a total sample of 700 men from Lanarkshire. 
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normal curve fitted to it, shows that, the divergences of the iiisbtiir.miH from their 
theoretical values on the hypothesis of normality are all *mull relative to the sizes 
of these frequencies. 

This example is sufficient to demonstrate that it is entirely unsafe to nHneb 
any significance to the peculiarities of the distributions tor small samples any 
made up by fewer than 200 individuals, say anti that mere in*pr<Hou of the 
forms of distributions for samples which would usually la* considered adequate 
for most statistical purposes is liable to mislead. The anthropologist win* deduces 
theories of racial mixture or relationship from the “ {teaks of small dh*f ribitf i*m« 
is deceiving himself by building on a statistical foundation which is uijwnnd 
owing to the inadequacy of the evidence. He is trying to gel more mfonmtt ton 
from his material than it can possibly provide with any awmmmi* *»f rortn-t iim, 
and forgetting that a sample only provides estimates of the features of the 
population represented. 

It is found in anthropological practice that distributions for small wimples 
frequently appear to he markedly skew, or to be hi- or multimodal, but »hul 
those for largo samples scarcely ever exhibit any of these prulianrieH Tlte 
writer is unable to give any exam files of a distribution, for flu? kind of sample 
now considered, which relates to more than 300 individuals and which foil*, to 
show a close approach to the form of a uniinoilnl and symmetrical curve, with 
the exception of two discussed in the appendix Mow. The omtrrenw of more 
than one mode which appears to he definitely outstanding, or of any appreciable 
degree of skewness, is apparently never exhibited by the distributions of 
characters for large samples of the kind encountered in «nt lump lingual practice 
The conclusion must bo that when such peculiarities arc found lor the diidrilm 
turns of small samples they merely demonstrate that small sample* am subject to 
large “errors” of random sampling. The “peaks” can have no fttitlirnpifogfoa) 
or genetical significance. Their unstable nature can often lm demon*! rated by 
merely grouping the individual measurements in a different way. n* tfoa may 
change the appearance of the distribution to an appreciable extent, 

If it can be demonstrated adequately that the distributions which occur in 
practice almost invariably indicate that the parental distribution* mum be 
closely similar in form to the normal curve, then this is obviously a eonrlmmm 
of great anthropological importance, since it implies that the kind of statistical 
analysis likely to be at all profitable is severely restricted owing to fin* nature of 
the material. Evidenfce of a more adequate statistical kind regarding the forms 
of distributions will now be considered. 

Values are given below of probabilities (**, P test) obtained by romjwmng 
various distributions with the theoretical frequencies obtained by fitting norma! 
curves to them. Those for the cephalic indices of series of Lanarkshire men for 
samples of 100 or more (see Pig. I, B and C) are in Table I. The lowest P found i« 
0-03, indicating that for samples of the size (100) actually drawn at random 
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from a normally distributed population, 1 in 113 would be expected to give a less 
good correspondence with the theoretical distribution than that exhibited by the 
sample in question. All the other P’s are much higher, indicating that the 
estimates provided by the samples bear a much closer resemblance to the normal 
form in these cases. 

TABLE I 

Probabilities (y 2 , P test) that the distributions of the cephalic index for different 
subsamples of 700 Lanarkshire, men indicate, that the character was normally 
distributed in the parent population 



Successive samples of 100 individuals' 11 

1st 

2nd 

3rd 

4th 

nth 

(itll 

7 th 

p 

0-02 

((■34 

()•«(• 

O'HH 

0*70 

0-03 

<>•23 



first 350 
individuals!' 

Second 350 
individuals!' 

700 individuals:]: 

P 

077 

<)• 114 

0*70 


* Distribution in 0 groups, t Distribution in 13 groups. { Distribution in lii I'roups. 

Table II gives the P’s for distributions of the cephalic index in the case of 
15 series of male skulls of the kinds which normally occur in anthropological 
practice, each series being made up by more than 100 specimens, These were 
selected merely because they happened to he the ones of the size required most 
easily accessible to the writer. The first i 1 are believed to he homogeneous in 
the sense that any subseries of them likely to he chosen for anthropological 
purposes would not show significant differences in any statistical constants, All 
but one of the P’s for these are high enough to provide excellent support for the 
hypothesis that the cephalic index was normally distributed in the parent 
population, The single exception is for an Egyptian series (3) which is known 
from other evidence to have statistical features which arc typical for “homo¬ 
geneous” samples, so the low P found for the cephalic index may well be 
attributed to the vagaries of chance selection. 

Series 12 and 13 are known to he heterogeneous to some extent, since the 
means of the cephalic index and some other characters for their subseries show 
differences which are significant though small. The P's for these two are both 
high. Series 14 and 15 arc known to represent populations which are unsuitable 
for the purpose of investigating group relationships, since they are exceptionally 
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variable. This is clearly indicated by the standard deviations for the cephalic 
index given in the right-hand column of Table U, and also by (how for certain 


TABLE 11 

Probabilities (y», P test) that the distributions of cephalic indices prmPkd % 
various series of mate skulls indicate that the diameter mu tmmalhj distributed 
in the 'parent populations 


No. of 
series 

Series* 

. _ i 

No. of ; 
skulls 

j 

Mean 

No. of 
gnml« 

Pt 

■i 

(fjj'Mt:, ,j 


(«) Series believed to be homogeneous 


J 

i 

Guaneho 

245 

7641 

11 

O90H 

2*3<t, (. pi 

2 

Eskimo (St Lawrence Is.) 

158 

77-1 

If) 

(HSH 

2 33 • m H , 

3 

Egyptian (20th-30th dyn.: Gizeh) 

866 

75'0 

16 

(Hll 1 

2*M »MOO * 

4 

New British 

114 

722 

11 

(him 

2 74 * '< in i 

6 

English (Camb, dissecting rooms) 

118 

76 8 

11 

0 157 

2 93 « U 19 i 

6 

Eskimo (Greenland) 

190 

71-3 

13 

0*694 

3 mi * o 15 j 

7 

Czech 

105 

82 9 

11 

0470 

3 IT j. 0*22 

8 

English (17th cent.: Whitechapel) 

131 

74*3 

7 

0 fffl 

3*26 « 0 20 

9 

German (Iteihengrttbor) 

220 

73*8 

13 

0-876 

3 36 } 0 16 

10 

English (17th cent.: Farringdon >St.) 

138 

75*4 

11 

0*137 

3 4s *" 21 

11 

Egyptian (18th—21st dyn,: Thebes) 

167 

75*1) 

13 

0*283 

3 .Vi | 0*19 


(6) Series showing small but significant differences 

lielwivti inimp uifut mthamrei 

12 

Egyptian (predynastic: Nmjiula) 

166 

72*7 

13 

; tnwfl 

2 80 - 017 ; 

13 

Swiss (Valais) 

463 

83*9 

It! 

0423 

4*01+0*13 ) 


(c) Heterogeneous mirks 


j 

k 

French (mediaeval and modem) 

1000 

79-6 

22 

1 (toot) j 

i 4*32 s (HO | 

16 

English (Bronze Age) 

| 1S1 

78*8 

12 

\ 0*660 

6-42 + 0*31 | 


[d] Artificial mixture of two series 


1 

2+12 

Eskimo + Egyptian 

321 

74*8 

16 

j 0*297 

j 

1 3-43 *0*14 1 

._ 1 


* The aeries are the ernes described in the billowing sources, which give the individual ntMitue* 
monts in the majority of cases i 


(I) Guanoho—Hooton (1025). 

(3) Egyptian—-Pearson & Davin (1024), 
(5) English—.Duckworth (1017). 

(7) Czech—Sohiff (1912), 

(9) German (pooled)—Morant (192ft). 

(II) Egyptian—Schmidt (1886). 

(13) Swiss-Pittard (1909-10), 


(2) Eskimo IlrdliSkn (IflSij. 

(4) New British Mnllw (ISOS). 

(6) Eskimo " Fuwt k Hanwu (IBIS), 

(H) English.Macdcmell (ItkH). 

(10) English—Hooke (1926). 

(12) Egyptian.Pawwtt (JM2), 

(14) French Topinanl (1888, 388). 


(16) English (pooled)—Morant (1926). 
f The P's were found from Table XII in Tables for Statistician* and BumBrieiam, Part I, the »* 
used being the number of groups, less 2 (i.e. degrees of freedom were taken „ number of group,. 

1CSS ijj* 

t 0'00003. 


other characters. One of these heterogeneous series (14) gives a P at* low as 
entirely to disallow for practical purposes the hypothesis that the index wait 
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normally distributed in the parent population, but the P for the other (15) is 
higher than several of those in section (a) of the table. The test thus fails entirely 
to distinguish between all suitable samples, on the one hand, and all unsuitable 
ones, on the other. Another example of its failure in this respect is recorded in 
section (d) of the table, the data there relating to an artificial mixture of two 
series having the very considerable difference of 4-4 units in their mean cephalic 
indices. The P of 0-3 obtained gives no indication of the strange origin of this 
sample. 

It is easier to ’point to series which are clearly unsuitable for purposes of 
classification than to any which would confidently be expected to be entirely 
satisfactory. A distribution which appears as likely as any to reveal the un¬ 
satisfactory nature of a aeries of the former kind has been provided by Herskovits 
(1930, p. 151). It relates to the thickness of the lips of 959 male adult “Negroes” 
in the United States. While a certain proportion of these men are believed to be 
of pure African origin, the majority represent varying grades of miscegenation of 
Negroes and members of other ethnic groups, with European ancestors pre¬ 
dominating but some admixture of American Indian blood, The character is 
obviously one which makes a marked distinction between the types of the two 
major groups of parental populations involved, The distribution for it gives a 
P of 0-536 (13 groups). Examples of the same kind relating to samples which 
are obviously unsuitable for purposes of group classification—such as mixed 
“white” samples in the United States, or artificial mixtures of series which 
differ markedly in some respects—might be multiplied indefinitely, and it is 
quite usual in examining such material to find distributions which give high P’s 
when fitted with normal curves. Trevor (1938) has observed this situation for 
all the characters he examined when dealing with data for living populations 
in different parts of the world derived from the crossing of European and non- 
European peoples. 

It is actually found that the vast majority of the distributions of measure¬ 
ments that occur in anthropological practice satisfy a test which shows the 
hypothesis that the character was normally distributed in the parent population 
to be not at all unacceptable. The high values of P usually found when compari¬ 
sons are made with normal curves fitted to the data do not demonstrate that the 
characteristic type of distribution for all but exceptional, and presumably 
unsuitable, populations is absolutely normal, but they are quite good enough 
evidence to suggest that the population distributions must be closely similar to 
imimodal and symmetrical forms. 

This generalization applies almost equally well, as far as is known, to all 
measurements, whether of shape or Bize, of series of living people or skeletons. 
Furthermore, it apparently applies equally well to series from groups of any size 
which are likely to be considered as populations by the anthropologist, Of the 
series of skulls dealt with in section (a) of Table II, some came from single 
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cemeteries while others were made up by combining from *-w*l 

scattered cemeteries. Maniples from small coimnunitiw of the |»r«n1iinl kind 
usually provide distributions which approximate to the form of tin* norma! 
curve neither more nor less closely than (to samples of flu* same size selected at 
random either from the populations of provinces or from thus** of ♦ oiintriw. The 
form of the distributions of metrical characters fails entirely to differentiate 
populations of very different sizes. 

The fact that a close approach to normality is almost invariably found 
suggests that this condition can safely be accepted m one whirl* must be 
satisfied by the distributions provided by samples before they rati be accepted 
as suitable for group comparisons, It has been seen, however, that tin* distri¬ 
butions of samples which are known to be unsuitable, for purfK*te« of clarifica¬ 
tion may also satisfy the condition. Hence the test must be supposed a nectwnry 
but by no means a crucial one. In the case of a particular sample the theoretical 
test might be that all characters should show reasonably high prnhahilif that 
the distributions in the population were normal. In practice', of eotirmt, the 
question can never be examined fur more than a small number of eharxrtem 
very seldom greater than off--but it in important to appreciate that the test 
should bo applied, in theory at. least, to all characters for which data arc avail¬ 
able, The demonstration that a single distribution indicate* a cWe approach to 
normality is no evidence that any other distribution for the same sample will do 
so, 

In general, useful anthropological results of a statistical nature mu only he 
derived from the conjoint investigation of data for several character*. 1 blterent 
characters are quite likely to suggest different conclusions, and the kind of 
information required has to be obtained by taking the evidence of a sufficient 
number into consideration. To take a hypothetical example, it may i«* amnrnsl 
that in the case of the crossing of two populations (d and II) the one derived from 
them (0) will only be likely to show any departure from the usual form of 
distribution-such as bimodality—in the case of those diameters which show a 
clear difference between the averages for A and U. Hut experience shows Hint 
the two parental groups arc likely to show dear difference in their average** for 
only a small proportion of the characters compared, even if ,-f and H belong to 
quite distinct ethnic groups. Hence peculiarity in the distributions for the 
hybrid population would be expected only in the case of a small proportion of 
the characters examined, and it is necessary to examine as many »« jHimdblo 
in order to ensure that such a peculiarity has not been overlooked, Furthermore, 
it should be appreciated that the chance of departures from normality being 
found is very different for different characters. This point is discussed below. 

A test which might be applied in practice could specify that the distributions 
of all metrical characters recorded for a particular sample examined should give 
P’s, when compared with normal curves fitted to them, greater than mrnie 
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arbitrarily chosen value— 0-001, say. If one or more of the distributions were 
found to give values less than the limit then the sample might be considered so 
peculiar that it should not bo used for purposes of classification. It is known, 
however, that many unsuitable samples would pass such a test, and hence it is 
of little practical value. Its value is lessened further by the fact that when a 
clear departure from normality is found the peculiarity of the distribution can 
nearly always be detected more easily on account of its abnormally large 
variation: this is so for series 14 in Table II. 

Much labour has been expended in making a detailed analysis of the distri¬ 
butions of characters provided by anthropological samples which are adequate 
in length for statistical purposes. Significant but slight degrees of skewness have 
sometimes been recorded. It has not been shown, however, that such examples 
provide any conclusions which are of the kind needed by anthropologists. Usually 
there is no guarantee that the samples were chosen absolutely at random, and 
.the slightly peculiar but erratic characteristics sometimes observed may well 
be due to biased selection. 

It has been shown that if the components of an index (AIB) are normally 
distributed, then the distribution of the index conforms more closely to a 
Pearson Type IV than to the normal curve. This topic lias been discussed by 
Merrill (1928) and Fieller (1932). A Type IV curve is unimodal and may be 
almost symmetrical and very similar in form to the normal. In fitting normal 
curves to a series of distributions it is generally not possible to observe any 
tendency for indices to give lower values of P than absolute measurements. 
Elderton & Woo (1932) made a detailed study of the forms of the distributions 
for a single long series of Egyptian crania in the oase of an unusual set of 
measurements relating to cranial bones considered singly. They concluded: 
“that the distributions of characters measured on the individual bones of the 
skull are not of normal type, but rather that the skewness and kurtosis of such 
distributions are peculiar to the individual measurement.” This is a point of 
considerable theoretical interest, though all the departures from normality 
observed were in fact slight. The conclusion that the population distributions 
almost invariably bear a closo similarity to the form of the normal curve in the 
case of all kinds of measurements appears to be sufficiently exact for almost all 
practical purposes. 

This is a result which anthropologists are very loath to accept in spite of the 
clear evidence of a large amount of material which supports it. The belief that 
markedly skew and bi- or multimodal distributions must be found was very 
generally held thirty years ago, and it has not been abandoned yet. The fact that 
expectations of this kind have been categorically denied to him is a matter of 
prime importance to the anthropologist, as it suggests that certain methods of 
analysis are quite impracticable on any valid statistical lines. He still fives in 
hope of discovering a method of dissecting a sample in such a way as to disclose 
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the sources from which it was derived. Judging largely from tin* prevalent nf 
unimodal and symmetrical distributions which offer no kco{» for rinwr-tfou. tmt 
also to some extent from other eharacteristiea nf wmipfcH to I jo wnmilrtwl. the 
statistician says that it is idle to entertain such a hope in dealing wish metrical 
data. He suggests that statistical tests may profitably be umi to distinguish 
suitable from unsuitable samples, hut that those selected for further use must 
be kept intact and somehow be compared with ttm* another as entities in order 
that conclusions may be deduced regarding the relationships and descent of the 
populations represented. 

The difference between the two points of view is fundamental. Repetition of 
the assertion that the vast majority of the samples with which they deal in 
practice do not indicate either appreciable skewness or more than one mode in 
the populations represented has had little effect in modifying the methods of 
anthropologists in general, or even of those who commonly u« statist seal 
methods. It is possible to go further, however, ami to offer an explanation of 
what appears at first to he a surprising uniformity in one characteristic of diverse 
kinds of material. 

It is an observed fact that the variabilities of all samples which ran be 
accepted far purposes of classification are very considerable and that they tend 
to be similar in the case of all populations for which adequate data, are available, 
the earliest of these having existed in Egypt about fitKKl »,c\ Absolute equality 
in the populations is not indicated, but intraradal variabilities all lit* w ithin a 
fairly restricted range in the ease of any particular character. Evidence in 
support of this statement will be provided in the following flection of this paper. 

In the case of a mixture of two populations, a distribution would only be 
expected to be bhnodal if the difference of the means for the two population* 
happened to be appreciable compared with the two intragroup variabilities. An 
estimate of the chance that the difference in question is likely to be appreciable 
will be given by the ratio of interracial to intrarackl variabilities, and tins is 
likely to be different for different characters. It has usually Iwen assumed that 
the differences between the types of populations defined by average measure¬ 
ments are often large compared with the differences Imtweon individuals 
belonging to the same population, but though this situation may be Into for 
skin colour it is not observed in the awe of any metrical character for which 
data are available. With the evidence of skin colour in view, anthro|«dogi«ta have 
always been inclined to over-estimate the interracial variability ami under¬ 
estimate the intraracial variability of other characters. The evidence of measure- 
meats corrects this tendency and shows that for metrical characters the situation 
is markedly different from that which has often been postulated. 

Examples relating to particular characters will now be considered. Pig. 2 
shows the distributions of stature for three series of men, The first relates to a 
tribe of Congo pygmies (Ete) recorded by gebesta & Lebaelter (1»33): it has a 
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mean of 1430 mm., which is very close to the smallest given for any people in the 
world, and a standard deviation of 514. The third relates to the Dinka tribe of 
Nilotic negroes, from unpublished data of 
A. MacTier Pirrie,* andits mean of 1804mm. 
appears to be the largest on record, the 
standard deviation being 79-7. The second 
distribution relates to men from a province 
of Japan recorded by Matsumura (1925), and 
it has a mean of 1622 mm., which is almost 
midway between those for the extreme 
series, and a standard deviation of 53-1. 

There is seen to be an absolute separation 
between the ranges of the statures for the 
pygmies and the Dinkas, the tallest member 
of the former group being shorter than the 
shortest of the latter, though it is probable 
that the distributions for larger samples 
would overlap to some extent. A mixed 
series made up by taking 100 pygmies and 
100 Dinkas would obviously bo bimodal. 

A distribution which would clearly be bi¬ 
modal wouldalso be providedif the Japanese 
men and the pygmies, or the Japanese men 
and the Dinkas, were takpn together. If 
mixtures were made up by taking pairs 
of series at random from all populations 
in the world, however, the chance of getting a particular pair with means 
differing by as much as 182 mm. ( = Dinka-Japanese mean) is quite small. 

This point is illustrated by the interracial distribution given at the bottom of 
Fig, 2, which was compiled by taking all the means for series of 30 or more men 
collected by Denikor (1026) and representing populations in all parts of the 
world. The mean of the distribution is 1646 mm. and the standard deviation 58'H. 
It can be shown that if pairs of values (i.e. means for series) are drawn at random 
from it, then the probability that they will differ by less than 80 mm. is 0'56, by 
less than 120 mm. 0-80, by less than 160 mm, 0-01, ancl by less than 200 mm, 
04)0. It is also known from experience that a mixed distribution of statures made 
up by combining two samples of equal sizes from populations in which the 
character is normally distributed, and which differ in their means by 80 mm, is 
quite likely to give a P on comparison with a normal curve fitted to it which 
would he high enough to make the incorrect hypothesis thus tested acceptable if 

* I am indebted to Dr Otto Samson, who is working on the material, for the statures of the 
D'nkas, The records were kindly lent by Prof. J. C, Brash. 
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Stature in cm. 


Pig. 2. Intraraoial distributions of statures 
representing an extremely short, an 
intermediate and an extremely tall 
group, and an interracial distribution. 
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the source of the material were unknown. With any appreciably wider sopamf it in 
of the means for the two component series all l y * found would pr»*Imltiy hr m 
low as to indicate clear departure from normality. The position may He summed 
up roughly in this way in the case of stature: il each of a number of sample* rd 
unknown origin was actually a mixture of t wo samples of equal sizes rvprmmf mg 
pairs of populations chosen at random from all in the world. then Onfarm <>f lIn 
distribution alone ,, in the ease of about half of them, would not He miftieiimtly 
peculiar to reveal dearly their distinction from samples representing single 
populations, but in the ease of the other half jHwulismty in origin would be 
indicated. In the same circumstances distributions winch were definitely 
bimodal would be expected to occur occasionally. 

Actually, however, such forms are scarcely ever found in practice, and 
hence it must be inferred that the condition postulated is of an artificial kind. 
In fact the anthropologist is not at all likely to choose a sample so badly that it 
represents what may he called a purely “mechanical“ mixture ol two distinct 
populations which differ appreciably in their averages for one or more dm 
racters. He is far more likely to come across a sample representing n population 
derived from the partial or complete blood admixture of two groups which wen* 
originally distinct. In such cases it is known that the mean for the mixed 
population will lie between those for the parental groups. Trevor found 

that the distributions for groups derived from the crossing of distinct ethnic 
stocks usually give high P’a when compared with normal curves fitted to them 
Samples occurring in anthropological practice may relate to diverse kinds of 
conglomerations, as it were, of mechanical and blood intermixture of population* 
which would best bo considered singly in examining group relationships, In arn-h 
cases the forms of the distributions of metrical characters will usually fail 
entirely to indicate the composite nature of the assemblage. A sample of the 
"white” population of the United States consisting of recruits w»« found to give 
a distribution of stature which could be adequately represented by a normal 
curve (Hoffman, 11)18), 

Anthropologists find it hard to accept the conclusion that tlmir frequency 
distributions for heterogeneous material never give any clear indication of the 
component parts of a particular sample. This situation is observed kn-iuiw the 
interracial variation of metrical characters is of the same order m, or smaller 
than, iritraraoial variation. The situation has been considered in the cane of 
stature, and for this character the interracial distribution gives a standard 
deviation of the same order as those normally found for intraraeial samples. 
These questions have been investigated most fully for certain meanurcunentw of 
the skull which are more reliable than the vast majority of the measurements 
available for series of living people, Data similar to those illustrated in Fig. 2 
for stature are given in Table III for three cranial characters. Tim material 
collected relates to series of adult male skulls madeup by 30 or more specimen* and 
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representing populations, believed to be suitable for purposes of classification, 
which have existed in various parts of the world from about 5000 b.o. to modern 
times, In the case of each of the three characters considered the distribution for 
the series with the lowest mean is given first, followed by the distribution of 
racial means and then that for the scries with the highest mean. 


TABLE III 

Distributions of three cranial measurements, each for an intraracial sample with 
an extremely low mean (L) and one with an extremely high mean (//), and for 
an interracial sample (I): male skulls 


Horizontal circumference 


Poraminal index 



2-231 4-14 I 4-11 


•11 15-1 


10-1 15-7 


7-05 Hi!) fW) 


* Loyalty Inlander (iSanwin ft Roux, liilli 22), 
t Tolmghitr (Koichcr, 1013). 
j Raining (New Britain) (Bauer, 1015). 


§ HHhcngriik-r (Morant, 1D2H). 
I! Tanganyika (Hied, 11115), 
r Vew7.ueli:in (Mnrmnn, lHiHl). 


The situation is seen to be much the same for the cephalic index as for 
stature, the interracial variability being rather larger than that for nearly all 
samples which can be accepted as suitable for purposes of classification. The 
distributions for the series with extreme means cover contiguous ranges, and 
there is no overlap. Of all the cranial characters for which adequate data are 
available the cephalic index is quite outstanding in this respect. For most of 
them the situation is very similar to that illustrated by the data in Table III for 
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the horizontal circumference of the skull. In this raw, the standard deviation 
for the interracial distribution is appreciably tow tlmn any intraracial value found, 
and the extreme distributions show considerable overlap, Tim limit in tin* other 
direction is exhibited by the index expressing the breadth of the foramen 
magnum as a percentage of its length. In this easts interracial variation is much 
smaller than intraracial variation, and the interracial distribution falls entirely 
within the range for any intraracial sample. For most crania! diameter- < he 
averages for all populations in the world are quite likely to fall within the range 
provided by any population chosen at random. 

It is evident that the chances that a heterogeneous sample will exhibit a 
distribution diverging appreciably from the normal form are very different for 
different metrical characters. As far as is known, stature anti Hit* cephalic index 
are the two most likely to show departures from Urn rule in the ease of mixed 
samples, since the ratio of inter- to intraracial variation is largest, for them, For 
this reason examples relating to the two characters have been given above, 
Most cranial measurements are decidedly less likely to reveal heterogeneity by 
the forms of distributions obtained for them, as the difference Mweeu the 
means of any two samples inadvertently combined compared with the two intra¬ 
group variabilities will be, cm the average, appreciably less for them. P«r some 
cranial characters interracial variation is so small compared with intraracial 
variation that no heterogeneous samples will be at nil likely to provide js'cnlinr 
distributions. 

In view of the circumstances discussed above, it is not surprising that the 
distributions which occur in anthropological practice scarcely ever show any 
appreciable departure from a symmetrical and unimodal form. Detailed st atisti¬ 
cal analysis of the slight peculiarities of the distributions sometimes observed 
is never likely to be profitable in tiie ease of small samples mnjmml of fewer 
than 300 individuals, say—-and in general it does not lead in the ease of ade¬ 
quately large samples to any conclusions which aid the elsnwitimtiim of the 
groups represented, 

The normal curve may he accepted as the one which almost invariably gives 
an adequate description of the distributions in the populations sampled, even 
though some of these populations are unsuitable for the purpimo in view, It is 
usually found to effeot this purpose remarkably well, in spit* of the fact that the 
theoretical ourve can never bo the correct one ns it is of unlimited range, In this 
application it is safest not to draw any inference* regarding the composition of 
a population from the observed fact that its distributions are normal in form. 
Very occasionally a clear departure from normality can be taken to indicate t hat 
a peculiar and unsuitable population is represented. 

For practical purposes the most important conclusion to be derived from the 
forms of the distributions of anthropometric characters is that dissection of the 
samples treated into subsamples which might be supposed to represent groups 
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of different origins is impracticable. Anthropologists in general are unwilling to 
accept this conclusion, Other characteristics of the samples, such as their 
variabilities and correlations, also have a bearing on the question and hence 
discussion of it will be deferred for a later part of this paper. 

The writer is indebted to Miss M. L. Tildesley, Prof, E. S. Pearson and Prof. 
G. von Bonin for criticism which led to the improvement of this first part. 


APPENDIX 

Certain distributions of cephalic indices provided by 
Felix von Luschan 

As far as the writer is aware, the only published distributions for any anthropometric, 
character which show a clear departure from normality, and which rolato to unsoloctcd 
samples made up by moro than 100 individuals, arc three provided by Felix von Luschan, 
These arc often referred to as ovidonco that such departures do occur in practice, though 
usually without comment on tho rareness of such an occurrence. All the distributions in 
question aro for tho cephalic index, and they all relate to groups of mon nabivo to the 
Near East. 

Tho first is for 179 mon who called themselves Greeks measured in Lycia and neighbouring 
localities in the south of Asia Minor, including a few islands. Tho figures for tho cephalic 
index wore published (von Luschan, 1990). A frequency diagram in tho same paper docs 
not agree with those figures, as it shows one index of 95 while tho highest in the table is 94, 
This error was perpetuated when the distribution was reproduced in two later publications 
(von Luschan, 1911, 1927). The frequencies are shown os histograms in tho top diagram of 
our Fig. 3 A. 

The socond scries is made up by 7 GO Turks from the south of Asia Minor and tho north 
of Syria. Tho figures for 187 of those men arc available (1890) but not those for tho re¬ 
mainder. A frequency distribution “reduced to ono-third” was given (1911, 1927), lb is 
not possible to read off tho individual frequencies from this with absolute accuracy, but tho 
middle diagram of Fig. 3 A reproduces thorn as closely as possible. With reference to this 
distribution, von Luschan says (1911, p, 236) that it ranges from 69 to 90, but tho highest 
valuo given in his diagram is 92. 

The third scries is of 1222 Jews: “62% of those wore Sephardim, whom I measured at 
Smyrna, at Constantinople, at Makri, and in Rhodes; the rest wore Ashkenazim measured 
by myself.at Vienna, Austria" (1911, p. 226). A frequency distribution “reduced to 
onn-fifth ’’ is given and this is reproduced as accurately as possible in tho bottom diagram of 
Fig. 3 A. 

The first and third of these throe distributions are cloarly bimodal in form. That for tho 
750 TurkB provides less clear evidenco of tho existence of more than one mode in tho 
population distribution, but it certainly suggests that this must have been clearly asym¬ 
metrical. The three distributions provided by von Luschan relate principally to populations 
of Asia Minor, and thoir peculiarities might be supposed due to the fact that this region is 
inhabited by peoples who are racially heterogeneous to an unusual extent. In fact, how¬ 
ever, other series from it do not appear to be at all distinguished by any unusual forms of 
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The corresponding pairs of series are also distinguished clearly by their variabilities. The 
following standard deviations arc found: 



Greeks 

Turks 

Jews 

Von Luschan's series 
Other series 

0-96+0-37 

3-21+0-19 

5-20 + 0-13 

4-70 + 0-16 

6:72 + 0-14 

2-90 + 0-17 


In comparison with other material, the abnormally high standard deviations of the 
cephalic index for von Lusohan’s three scries show at once that the samples are of a peculiar 
kind unsuitable for, racial comparisons, This material is altogether exceptional and it does 
not affect tho general conclusion that the distributions of metrical characters for samples 
treated in anthropological practice almost invariably indicate that the population distri¬ 
butions do not diverge appreciably from the normal form. 
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1. Introduction 

The human skeletal remains which form the subject of this report were excavated 
by the Wellcome (later the Welloomc-Marston) Archaeological Research 
Expedition to the Near East from 1033 to 1936. They were collected by the late 
Mr J. L. Starkey, who was then the Director of the Expedition, and it was 
owing to his enthusiasm and untiring efforts in the field that so much material 
is now available for study. His tragic death in 1938 deprived archaeology 
and physical anthropology alike of one who was chiefly responsible for the 
collection of some of the most valuable evidence extant relating to the early 
history of the peoples of the eastern Mediterranean. 
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100 Cranial and other Unman Unmans from Pnlrstim 

The human remains from Mil'll were brought to England. ami the greater 
part of the work on them was carried out at the ('niton Laboratory, f nivejxify 
College, London, under the supervision of Dr (!. M. Mutant. I am indebted to 
the Trustees of the lute Sir Henry Wellcome for a grant which enabled me to 
carry out this work over a period of rather more than three years, and ulwi for 
a subsidy which has made possible the complete presentation of the illustrative 
and tabular matter collected for the report. 1 also owe a very great debt ol 
gratitude to Dr Morant for his untiring help and advice in ail the stages of its 
preparation. A complete report of the excavations in in ennrse ol preparation, 
and I have to thank Mr Charles luge., the present Director of the Expedition, 
for most of the following particulars relating to the discovery of the bones. 

The only published account of them is a note by Mr Starkey illtUbj, prefaced 
to a description of three trepanned skulls by Dr Wilson Parry. He reports t hut 
a roughly circular chamber (No. 11)7), which contained a deposit id human 
remains much damaged by (ire, was opened in lb.‘If. At the same time mt 
adjoining and larger rectangular cavern (No. 12o) was located. Starkey writes: 

“The top layer consisted of many animal hones, mostly pig, and tin-, relnne 
should he ascribed to the latter lmlf of the .Judean kingdom... .The lower or 
main deposit consisted of a mass of human bones, the remains of at least 1 .'mil 
bodies, As they were pitched in through the hole in the broken naif, the skulls 
rolled down from the apex of the pile to the sides of the clmmlwr.’* 

Shords of pottery intermixed with the bones can all be assigned to the 
seventh and eighth centuries me. 

"Some bones wore partially calcined, suggesting that they were abstracted 
from burnt buildings.... Careful supervision of Hie eleivranee failed to establish 
that any crania were in articulation with vertebrae, and the jaws were rarely 
attached: in fact, no order was seen in the jumbled maw." 

It is suggested that the ossuary was probably connected with the salvage of 
Lachish after its partial destruction by Sennacherib, King of Assyria, in 7»i iu\ 

"When floor level was reached it became clear that the tomb had been 
previously used as a dwelling, a door had been (rut at the N.K. {given errant «u«ly 
as N.W, in the published account] corner, connecting the smaller circular tomb 
with it, which contained the 500 bodies discovered in 1004. From the style of 
both chambers it is certain that they were originally excavated to emit mu early 
fifteenth-century burials,” 

Excavation of the area was extended and two other tombs were found, of 
which no details have yet been published. Fig. I is a plan of the area in quest ion, 

showing all four tombs from which the human remains treated in this study were 
obtained, 


The archaeologists report that: 


All the tomb chambers were adjoining and interconnected, and the 
deposits mNos. 107,108, and 120 were identical and as described by Mr.Starkev. 
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Tomb No, 110 is in a slightly different category; it is an artificial cavern, with 
a small entrance cut in the south wall of No. 108 near the bottom, containing 
normal burials comparatively undisturbed. The period however is the same.” 

It is said that the later discoveries do not provide any evidence which makes 
closer dating possible. They include a Cypriote juglet, of a type common in 
Palestine in the eighth century b.c., scarabs and scarahoids, bone pendants, 
and blue glaze amulets. The last include representations of Egyptian deities 



Pig. 1. Plan of the tombs at hashish from which the human remains were recovered. The broken 
linen indicate approximately thelloorareaon'omb 120, and theoutlineoftho benches (inner line) 
and the maximum perimeter of Tomb 11(1. The approximate heights of the tombs wore: 
107, 2 m.; 108, 2-2 m,; 11(1, HI m.; and 120, 3-D m, 

which were long popular in Palestine. There is nothing in the objects that one 
would be surprised to find in normal burials at Lachish in the middle of the 
Jewish period. 

Eurther particulars regarding the state in which the human remains were 
found are available. In all the tombs, except No. lift, there were distinct layers 
of animal bones above the human deposits, and in the shallower part of Tomb 108 
they came up to the roof level, the roof itself having been broken away. In 
Tomb 107 the apex of the pile of human bones was about l m. above the floor 
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level, and the same height wan about 1*3 in. in Tomb liKi. Then* wore fewer 
human remains in Tomb 108: these were more scattered, and then* were many 
animal bones above them. 

Tomb 116 is in a different category from the other*, for it remained several 
formal burials laid out on the benches, with the usual funerary equipment. 
These burials were partially disturbed, and the skulls could not be. distinguished 
from others which may have rolled in at the time of the filling of Tomb UW, 
The roof of Tomb 118 was intact, and most of the tong bone* collected come, 
from it as they were the best preserved. 

The estimate that the remains of 1500 individuals were represented applies 
to Tomb 120, and it is admittedly very rough, ft was impossible to judge 
whether the piles of bones included the complete skeletal remains of 15no 
people or not, but the impression was gained that the skeleton of the head was 
far better represented than that of any other part of the body. A few mandibles 
attached to crania were found, but there appeared to Ik* no other eases of parts 
of the skeleton existing in proper articulation. A suggested explanation of the 
circumstances which led to the existence of the ossuary is given in the following 
section of this paper, as evidence derived from the bonus preserved lots a bearing 
on this topic. 

Plates 1 and II reproduce photographs taken in the interior of Tomb 121) 
after the clearance down to floor level. The piles of skulls which had amumilated 
round the walls are shown in situ. Plate II it is a close view of a small group of 
skulls seen to the left of the beam in Plate I a and above the pick in Plate l it. 

Other human remains from a number of tombs at Whiah, representing 
fewer than 100 individuals in all, have been preserved, and the writer is pre¬ 
paring a separate report on this material. 

2. The general nature or the remains and remarks 

ON THE ORIGIN OF THE OSSUARIES 

The human remains described in this report comprise all that were preserved 
from the four tombs referred to in the foregoing section. They arc made up 
principally of crania, which appeared to be more abundant in the deposits than 
any other parts of the skeleton, but there are also scries of mandible* ami long 
bones of the limbs, and a few other bones, A selection was mark* of (he hotter 
preserved remains, and preference was given to crania. It should }«* maimed 
that the relative numbers of different parts in the collection preserved canned be 
supposed to be proportional to the frequencies with which they actually occurred 
in the ossuaries. 

The specimens were dipped in paraffin wax in the field, and the cleaning of 
them m the laboratory was a laborious task. While it was being carried out. four 
crania distorted by earth pressure were found, as well as a certain number of 

ragments of other parts of the skeleton which may have belonged to individuate 
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included in the series, or to individuals not otherwise represented. All this 
material was discarded, and it is not counted in the totals. The numbers of 
specimens, most of which are more or less incomplete, from each tomb are given 
in the table below, no distinctions of sex or age being made: 



Tomb 

107 

108 

116 

120 

Totals 

Crania 

74 


46 

607 


Mandibles 

12 


7 



Femora 

2 


60 

28 


Tibiae 

2 

1 

27 

15 

45 

Fibulae 

— 

— 

1 

— 

1 

Humeri 

2 

— 

23 

23 

48 

Radii 

1 

— 

6 

8 

14 

Ulnao 

3 

3 

3 

8 

15 

Sacra 

— 

— 

— 

2 

2 

Clavicles 

4 

— 

2 

l 

7 


There are also a few vertebrae associated with skulls. 

The absence of other bones of the skeleton which would normally be well 
preserved, such as those of the wrist and ankle, should be noted. In this con¬ 
nexion Mr Inge remarks: “Less attention was paid to the skulls of children 
owing to their supposed smaller anthropometrical value, and the same applies 
to parts of the skeleton other than skulls and long bones." 

None of the bones are known to he associated, except that ten mandibles 
and crania were preserved together, and are undoubtedly paired. It is probable 
that a considerable proportion of the other mandibles do belong to the crania, 
but it is not possible to associate them. Experience has shown that any attempt 
to pair parallel series of these parts is unprofitable, It was also impossible to 
associate the other bones with the crania or with one another. 

After separating the immature specimens, the crania and mandibles were 
sexed anatomically, giving the totals shown in the table below. Estimates of 





Tomb 


Totals 

107 

108 

110 

120 

Crania 

Adult $ 

33 

5 

10 

303 

360 (61-8%) 


Adult ^ 

31 

3 

25 

216 

274 (39*4%) 


Immature 

10 

1 

1 

40 

61 (8-8%) 

Mandibles 

Adult <J 

5 

0 

3 

26 

34 (44-7%) 


Adult $ 

7 

1 

4 

24 

36 (474%) 


Immature 

0 

0 

0 

6 

6 (7-0%) 
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this kind cannot be exact in all cases, of course, but experience suggests that 
80-90% are likely to be correct. The inclusion of a certain proportion ol male- 
like, but actually female, specimens in the male series, or vice versa, is not 
likely to affect statistical constants appreciably. It has been shown by Martin 
(1936) and Cleaver (1937), that isolated mandibles can be sexed anatomically 
as accurately as crania. Particulars regarding the other bones ol the skeleton are 
given in § 10 below. 

The proportions of the sexes and of immature individuals are. seen to be 
very much .the same in all the tombs. The corresponding male and female 
percentages in the table are not particularly close, but the series ol mandibles 
is obviously too small to give reliable proportions. It is safe to assume that 
there are rather more males than females in the collection preserved. This does 
not necessarily imply that the same was true for the total number of individuals 
deposited in the tombs, as the male skulls, being stronger cm the average, would 
stand a slightly better chance of being preserved than female specimens. It. is 
probable, for a similar reason, that the proportion of immature individuals w as 
very considerably higher in the tombs. The proportions of men, women and 
children there appear to have been very similar to those which would lie expected 
in a normal cemetery population. 

Estimates of the ages at death of the adults are discussed in § 3 below. It is 
shown there that the population was considerably younger than such as is 
normally found in ancient or recent cemeteries. There are very few aged njieei- 
mens in the collection. 

The condition of the bones throws little further light on the question of how 
and why they were deposited in the tombs, but the following observations may 
be noted in this connexion. There is only one skull (No. 10K, Tomb 120) with 
wounds which are likely to have been the cause of death, This has pieces of 
bone removed from both parietals (see Plate III b), and a long cut extending 
across the left temporal line. The other lines on the left parietal and extending 
across the sagittal suture, along which the bones are broken, may have, been 
made at any time after death. As far as can be told the damage shown in the 
case of all the other speoimens occurred in the tombs, and after the death* of 
the individuals. The greater part of it may be attributed to the crushing weight 
of the piles of bones,' 

Among the totals for all tombs combined, there are twenty-nine (K*t%) 
adult male, eight (2-9%) adult female, and two (3-3%) immature skulls with 
wounds, mostly slight, which had healed completely during life. These fre¬ 
quencies are not higher than those commonly found for excavated series of 
crania. 

In his account of the discovery of the remains, Starkey notes that those 
in Tomb 107 were much damaged by fire, and that some bones in Tomb 120 
were “.. .partially calcined, suggesting that they were abstracted from burnt 
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buildings”. It is probable that the great majority of the burnt specimens were 
too damaged to be worth preserving. In the series preserved there are only 
three skulls with burnt patches.* No. 485 ($, Tomb 107) has an extensive area 
on the occipital and right parietal bones affected (see Plate III a), No. 537 
(?, Tomb 120) shows a smaller area on the occipital bone, and No. 681 (im¬ 
mature, Tomb 120) shows a small burnt patch above the right orbit. 

It may be noted, too, that there are three trepanned skulls (all males from 
Tomb 120), and a few artificially deformed skulls from the same tomb. There 
are two male skulls and one female skull which show definite signs of diseased 
conditions affecting the bone. Again, they all come from Tomb 120 but it must 
be remembered that the series from this tomb is far longer than that from any 
of the other tombs. 

The evidence relevant to the provenance of the human remains from the 
four tombs at Laohish consists partly of archaeological observations, and partly 
of deductions from the remains preserved. It must be remembered that the 
series available were selected from a larger bulk of material. Taking the four 
tombs together, the total remains in hand must represent rather more than 700 
individuals. It was estimated that those of about 1500 were present in Tomb 120 
when it was opened. As the dimensions of the piles of bones are known approxi¬ 
mately, it may be asked whether they could possibly have been made up by 
bodies, or skeletons, which were complete when deposited. Taking Tomb 120 
alone, the dimensions of the pile were roughly: height 1‘3 x 7-5 x 8-5 m. (see 
Fig. 1), giving a volume of about 83 cu.m. It is possible that at least 1500 
commute, skeletons, including some of children, were interred in a space of this 
size. The tomb was nearly three times as high as the pile of bones, and hence it 
is also possible that it could have contained a heap of 1500 complete bodies, 
which might be supposed to have subsided on disintegration. The animal bones, 
which formed a distinct layer over the human remains, may have been thrown 
in after such a subsidence. So far there is nothing to preclude the possibility 
of the interment having been either of bodies, or of complete skeletons, such as 
might have occurred in clearing a neighbouring cemetery. 

The hypothesis that the remains were obtained from a cemetery—so that 
Tomb 120, and possibly Nos. 107 and lOHaswell, were really clearance pits—seems 
to be unacceptable for several reasons. In the first place, it is almost invariably 
found that among bones cleared from cemeteries, crania and femora are far- 
better represented than any other parts of the skeleton, and in such a case a 
pile as large as that in Tomb 120, say, would have been likely to contain far 
more than 1500 skulls. After clearance, too, it would have been unlikely that 
any skulls and mandibles would have been associated, though a few pairs were 

* It was found that the paraffin wax in which the bones had been dipped could be removed 
moat effectively with the aid of a flame, and although great care was exercised, a few specimens 
now show signs of slight burning, chiefly at the base of the skull, acquired in the laboratory. 
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found. Another argument against cemetery clearance is that few skeletons of 
children are likely to have been thrown in the tombs, on the hypothesis eon* 
sidered, and practically all these would probably have been too crushed in the 
pile to be worth preserving. Of the total number of skulls in the series, 8*8% 
are immature, including a fair proportion of children under It) years of age, and 
it is unlikely that this frequency would have been so high after two st ringent 
selections of the material. The scarcity of aged specimens also tolls against the 
hypothesis of clearance from a cemetery. 

In view of the fact that the skeletons of children are far more easily damaged 
than those of adults, and that bones of women are more easily crushed than those 
of men, it is not at all unlikely that the proportions of immature to mature, and 
of female to male, skeletons in the tombs, were very similar to those of the liivuj 
population of Lachish at a particular time,. The rarity of aged specimens must 
be remembered, These facts seem to tell against the theory that the individuals 
in the tombs were massacred, but a stronger argument against it is t lie fact that 
only one skull showing injury which might have boon the cause of death is 
recorded. 

The most plausible explanation, which appears to he in accordance with all 
the evidence, is the following, It may be supposed that some catastrophe, such 
as pestilence or earthquake, overtook the population of Lachish about the year 
700 b.c., and that a large proportion of the inhabitants were victims. Ordinary 
burial at such a crisis would have been impossible, and in clearing the. town, 
some time after it, the underground chambers in question would have been 
convenient depositories into which bodies could have been thrown. The fact 
that some were burnt is not surprising, as the catastrophe may have been 
accompanied, or followed, by a fire in part of the town. The fact that the age. 
distribution is appreciably younger than would be anticipated for any cemetery 
population of the period is accounted for on the hypothesis considered. 

3, Remarks on the condition and anomalies (other than dental) 
oe the Lachish crania 

Remarks on the Lachish skulls are given in the appended tables of individual 
measurements. Owing to the incomplete nature of many of the specimens, the 
totals observable in the case of particular anomalies are generally considerably 
less than the totals for the whole collection, and hence they have to be given 
separately in considering the frequencies of different conditions. The total 
numbers of the skulls from different tombs divided into sexes for the adults, and 
the numbers of immature specimens, are given on p, 103 above. In general, the 
frequencies are considered separately in the case of the long series from Tomb 
120 and of the series made up by combining the skulls from all the other tombs, 
subdivision of the latter is inadvisable, as the numbers from all the tombs other 
than 120 are small. 
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Sutures. Remarks on the coronal, sagittal, and lambdoid sutures are 
given in the tables of individual measurements. If no mention of one of these 
is made there, this indicates that it shows no sign of closing. Every specimen is 
complete enough to observe the three principal sutures. The approximate 
estimate of the age constitution of the series, which can be obtained from these 
observations, will be considered first. Frequencies are given in Table I. If 
a suture showed any sign of the beginning of closure, the skull was counted in 
the second category (sutures beginning to close or partly closed). A suture 
was counted as closed if synostosis was apparent for at least the greater part of 
its length. 


TABLE I 

The age constitution of the adult Lachish and other series estimated from, the state 
of closure of the •principal calvarial sutures ( coronal , sagittal, and lambdoid)* 


Series 

All HuturcH 
open 

Sutures 
beginning to 
close or 
partly closed 

All sutures 
closed 

Totals 


Laebish, Tomb 120 

Other tombsf 

Total} 

Kerma, 12th and 13th dyn. 
Glitch, 20th-3Oth dyn. 
Farringdon St, Londoners 
Whitechapel, Londoners 
Spitftlfiolds, Londoners 

Hythe 

123 (43'3%) 

20 (45-0%) 
141) (43-7%) 

>6(11*3%) 

74 (37-0%) 

HI (12-2%) 

21 (154%) 
11)3 (19-3%) 

23 (2(H) %) 

153 (534)%) 

28 (494%) 
181 (534%) 

83 (58-0%) 
103 (51-5%) 
UK) (644%) 

85 (02-5%) 
301 (60-3%) 

62 (504%) 

« (*«%) 

3 (5.2%) 

11 (3-2%) 

42 (20-8%) 
23(11-5%) 

37 (23-7%) 

30 (22-1%) 
131 (24-5%) 

25 (22-7%) 

284 

57 

34 L 

141 

200 

150 

130 

535 

110 

9 

Laehish, Tomb 120 

154 (744)%) 

54 (204)%) 

1) — 

208 


Other tombs 

45 (70-3%) 

14 (23-7%) 

() „ 

59 


Total 

lit) (74-5%) 

08 (25-5%) 

0 - 

207 


Kerma, 12th and 13th dyn. 

40 (354%) 

51 (44-7%) 

23 (20-2%) 

114 


Farringdon St, Londoners 

04 (40-3%) 

80 (31)4%) 

20(14-3%) 

203 


Whitechapel, Londoners 

00 (4(1 , 8%) 

40 (344%) 

20(184%) 

141 


finitnl fields, Londoners 

121 (4114%) 

04 (384%) 

30(12-2%) 

246 


Hythe 

51 (50-3%) 

211 (30-2%) 

0 (10-5%) 

80 


* The data in this table for all the series other than the Laehish are taken from Ktoewiger & 
Morant (11)32, p, 17(1) and Collett (11)33, p. 251)), 
f Comprising Tombs 107. 108, 11(1. 

} The posthumously and artificially deformed Laehish skulls, and those not included in the 
pooled series on account of the fact that they show premature dosing of the sagittal suture, as 
well as No. 380 (premature, dosing of the coronal suture) and No. 382 (of unusual form), are not 
included in the totals given in this table. 


It should he note.d that all the data relating to sutures refer to their ecto- 
eranial aspects, The endoeranial surface of the brain-box could only be observed 
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in a few specimens, as cleaning was impossible, and hence no remarks on the 
inside of the skull were recorded. 

The division of the adult skulls into the three categories shown in Table I 
was made with the primary purpose of obtaining a rough estimate of the 
distribution of age of death for the individuals whose skulls are preserved. It is 
known that there is considerable variation in the ages at which the, different 
sutures close; a skull with the sutures partly closed, for example, may well 
have belonged to an older individual than one with all sutures open. The actual 
age distributions for the separate groups, if they could he known, would doubtless 
show very considerable overlap. In spite of this it is safe to assume that the 
percentages may fairly be compared with those given for another series observed 
in precisely the same way, in order to determine differences in the age. constitu¬ 
tions of the samples. The data for the other scries given in the table were 
actually collected by Dr Morant, who confirmed the fact that my records had 
been obtained in a comparable way. 

There are very clear differences between the percentages for the lav-lush 
skulls and those for the other series, while the corresponding values for Tomb 12U 
and the combined series from the other tombs are remarkably close, The usual 


TABLE II 

The frequencies of different orders of closing of the principal calvarial 
sutures for the Lachish skulls (all tombs)* 


Sagittal dosing first, coronal and Iatnbdoid open or closing together 

Sagittal closing first, coronal second and iamhdoid last 

Sagittal closing first, lambdoid second and coronal last 

Coronal closing first, sagittal and lambdoid open or closing together 

Coronal closing first, sagittal second and lambdoid Inst 

Lambdoid dosing first, sagittal and coronal open or dosing together 

Lambdoid closing first, sagittal second and coronal last 

Sagittal and coronal closing together before lambdoid 

Sagittal and lambdoid closing together before coronal 

All threo sutures dosing together 


Totals 


* 


-J 

it 

7 it 

23t 

21t 

3§ 

17;! 

2 

21*! 

24 

3 

0 

2 

1 

2 

•1 

37 

12 

11 

1 

7 

2 

1 2H3 

, 

74 1 


* Excluding the posthumously and artificially deformed, but including the eleven imltt and 
six female skulls sho ving premature closing of the sagittal suture, and No. M), showing premature 
'closing of the coronal suture. 

t Including eight showing premature closing of the sagittal suture. 
t Including five showing premature dosing of the sagittal suture, 

§ Including one showing premature closing of the sagittal suture. 

1! Including three showing premature closing of the sagittal suture 
f Including No. 380. 
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sexual difference, due to the fact that the sutures close at a later age for females 
than for males, is found. It is evident that the average age at death must have 
been considerably less for the Lachish adults than for any other of the groups of 
people. All the other series, except the Whitechapel (obtained from a burial pit, 
probably used in time of plague), are believed to represent ordinary cemetery 
populations. 

Statistics relating to the orders in which the three principal calvarial sutures 
were closing are given in Table II. It was possible to observe these orders in 
the case of all the adult skulls for which closure had commenced, except one 
aged male (No. 357) in which all three sutures are completely obliterated. The 
special series of adult specimens presumed to have been affected by premature 
closing of the sagittal suture, and the one (No. 380) distorted owing to premature 
closing of the coronal suture, are included in Table II. They were omitted 
from Table I, as their anomalous conditions might give fallacious estimates of 
age at death, 

The figures show that for the majority of skulls the sagittal suture began to 
close before the other two, and that the coronal normally closed before the 
lambdoid. The order sagittal-ooronal-lambdoid is also found with the greatest 
frequency for the Kerma Egyptian series (Collett, 1933). The frequencies for 
the two series may be summarized in another way, as follows, only skulls for 
which the first suture to close can be observed being included: 




9 


Lachish 

Kerma 

Lachish 

Kerma 

Sagittal dosing before other two 
Coronal dosing before other two 
Lambdoid dosing before other two 

120 (81-1%) 
24(16-2%) 

4 (2-7%) 

75 (79-8%) 
18 (19-1%) 

1 (H%) 

28 (47-5%) 
30 (50-8%) 

1 d*7%) 

34 (68-0%) 

14 (28-0%) 

2 (4-0%) 

Totals 

148 

94 

fill 

60 


The values of the corresponding percentages for the male series arc quite 
insignificant, but differences which must be considered significant are found 
for the first two pairs of percentages in the case of the females. It should be 
pointed out that the question which suture shows a more advanced stage of 
closure than the other two may depend on the age of the individual, and if the 
rates at which the different sutures close are markedly different comparisons 
between the frequencies considered for different series may be misleading. 

For the English series, examined in the same way, the sagittal suture was 
found to he the first to close, almost invariably followed by the coronal and 
lambdoid, which appeared to close together. For a Negro series the coronal 
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was found to show a slight tendency to close before the sagittal. There are 
certainly racial differences with regard to the order in which the calvarial sutures 
close, and, as far as can be seen, the Laohish series agrees with Ancient Egyptian 
in this respect. 

It can be seen from Table II that there are only four male and one female 
skull showing the lambdoid suture closing before either of the others. An 
examination of these specimens shows that their sutures were definitely closing 
in an anomalous way in the majority of cases. Of the four males, Nos, lK4, 266 
and 330 show the oecipito-mastoid suture obliterated on both sides; No, 1K4 
also shows the parieto-mostoid suture obliterated on the left. No. 2(16 shows the 
temporal squama completely fused to the parietal on the right, and partly fused 
on the left; and No. 330 shows both parieto-mostoid sutures obliterated and the 
temporal squamae largely fused to the parietals. The other malt* skull with the 
lambdoid .suture closing first (No. 2(18) has all the sutures between the temporal 
hones on the one hand, and the occipital and parietal, on the other, open. The 
female specimen (No. 619) has the occipito-mastoid sutures obliterated, and 
the posterior parts of the temporal squamae fused to the parietals, 

There are five other male skulls, and two other female, showing obliteration 
of the parieto-mastoid suture, and/or partial or complete fusion of the temporal 
squamae to the parietal bones. In these cases the principal calvarial sutures 
are open or closing in a normal order, as far as can he seen, ami one (No, 367) 
have them obliterated. The specimens in question are: 

No. 90, $. Ocoipito-mastoid suture L obliterated, and posterior half of 
temporal squama L fused to parietal. 

No. 171, d. Occipito-mastoid suture L obliterated, and posterior half of 
temporal squama L fused to parietal. 

No. 230, $. Temporal squama R largely fused to parietal. 

No. 262, $, Occipito-mastoid suture L and parieto-miwtoid suture L obliter¬ 
ated. 

No. 367, $, Temporals completely fused to occipital and parietal bones, 

No. 383, ?. Ocoipito-mastoid sutures, R and L, and parietu mastoid suture L 
obliterated. 

No. 478, $. Occipito-mastoid sutures closed, R and L, temporal squamae Fused 
to parietals, R and L. 

There is one juvenile specimen (No. 680) showing the right panelu-maatoid 
suture obliterated, but all others open. 

In all there are six male skulls showing the temporal squamae completely, 
or partially, fused to the parietal hones, three on both sides, three on the left 
only, and one on the right only; and there are two female skulls showing the 
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same condition on both aides. A count was made of the number of oases showing 
the occipitomastoid suture completely obliterated on one or both sides; the 
totals are: 


I' ' " ■ 

Right. 

hit 

Right 

Total no. of'adult 

1 

and left 

only 

only 

skulls examined 

■> 

t\ 

K 

I 

351 

9 

l _ __ 

fi 

2 

1 

272 


When the condition is unilateral, it appears to he shown more frequently on 
the left than on the right side, but the numbers are too small to warrant a 
generalization, 

Premature closing of the sagittal suture was noted in the case of eleven male 
and six female adult skulls, all coming from Tomb 120. These were not included 
in the series used for comparative purposes, but individual measurements for 
them are given in a separate section of the appended tables. In the case of six 
male and four female specimens the sagittal suture is completely obliterated, 
and the coronal and lambdoid completely open. In the ease of the remaining 
five male and two female specimens the sagittal is completely obliterated and 
one or both of the other two sutures are closing or closed. Few of these skulls 
show any clear sign of distortion owing to the abnormal closure of the sagittal 
suture, the only exceptions being No. 304 (male) and Nos. 067 and 072 (female), 
these having the lowest cephalic indices. The three can safely he called scapho¬ 
cephalies, but the forms of the others are not exceptional, and hence it may be 
presumed that their sagittal sutures did not become synostosed until growth 
of the brain-box was nearly completed. The cephalic indices of all the specimens 
showing premature obliteration of the sagittal suture compared with those of 
the normal series (Tombs 107,10H, 110 and 120 combined) are as under: 


. 


6O-05- 

05415 

664)5- 

70415 

704)5- 

754)5 

764)5- 

804)5 

804)5- 

8605 

86-06- 

1)04)5 

Totab 

,S 

Normal serif* 

1 

20 

160 

112 

11 

0 

310 


Premature closing of 
sagittal nature 

1 * 

1 

7 

2 

0 

0 

11 

9 

Normal series 

0 

H 

112 

117 

15 

1 

252 


Premature closing of 
sagittal suture 

L _ 

1* 

H 

3 

1 

0 

0 

6 


* Accepted ns acaphoeophalie, 


There are no brachycephalic skulls among those in the special series, but the 
indices for the majority of them are clearly not abnormal. The fact that all the 
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adult specimen^ showing premature dosing of the sagittal suture came from 
the same tomb (No. 120) is suggestive. It may well be that the condition is of 
genetical origin, and that the affected individuals were related. This explanation 
would account for the high proportion of specimens exhibiting the anomaly. 

One other female skull (No. 577) has the posterior half of the sagittal suture 
completely obliterated, but the anterior half, and the coronal and lambdoid 
sutures, completely open, The specimen also shows a distinct post-coronal 
depression, though this is not apparent in the median sagittal plane. 

There is only one other skull (not included in the normal series) which 
apparently shows deformation resulting from premature closing of a suture. 
This is No. 380 (Plate VII f): its coronal suture is almost obliterated, while the 
sagittal and lambdoid are completely open. This is probably a case of oxy¬ 
cephaly, and, though the deformation is not unlike that which was clearly 
produced artificially in the case of certain specimens, yet all those showing 
artificial deformation most clearly have the coronal suture completely open. 
No, 380 has a height-length (100 II'/L) index of 83*3, whereas the highest for 
a normal adult male skull of the series is 80-8. 

No. 59 (male) shows the superior half of the coronal suture on the right side 
obliterated, but other parts of this suture open, while the sagittal is beginning 
to close and the lambdoid is open. It does not appear to he deformed, the height- 
length index of 75*1 being rather high but by no means extreme. No. 584 
(female) shows the right side of the coronal suture closed, while the left aide 
and the sagittal and lambdoid sutures are open. 

The oomplete metopic suture was found for twenty-six of the 341 male 
adult skulls included in Table I, and slight traces of it were observed in a few 
other oases: it was also found for twenty-two of the. corresponding total of 287 
female specimens. There are nineteen of the male, and twenty-one of the female 
affected individuals showing the metopic and sagittal sutures either both open, 
or closing together. In the remaining female, and six of the remaining male 
specimens, either one or the other suture is closing first, but the difference 
between the state of closure of the two is only slight. The greatest difference in 
this respect is shown by the last male specimen, which has the sagittal suture 
closed and the metopic open. The conclusion derived from earlier material that; 
the metopic suture normally closes about the same time as the sagittal, if it 
persists to an adult stage, is thus confirmed. 

The male percentage for the condition is 7*6, and the female 8*2, These 
values are higher than those for the Kerma Egyptian (4*5 male, and 0*2 female), 
but rather lower than the percentages usually found for European series. Of the 
sixty-one immature skulls from Lachish, there are eleven (18*0%) showing the 
complete metopic suture, an appreciably higher proportion. Of the eleven male 
•adult skulls not included in the normal series on account of premature closing 
of the sagittal suture, one is metopic, and of the six female in the same group 
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two show the condition. Of the seven male skulls which show patent artificial 
deformation, or a suspicion of this condition, two are metopic, and of the two 
female, one is metopic. The two specimens which are most clearly deformed 
(Nos. 381 male, and 673 female) are both metopic, which is curious. 

A few other sutural anomalies, besides those treated under supernumerary 
bones below, were noted. Among 267 female skulls, twelve were found with traces 
of the suture between the ex- and supra-occipital bones, but no trace was found 
in any male specimen. Only one example of a complete suture across a malar 
bone was found (No. 154 male), the right malar bone being divided and the 
left normal. No. 216 (male) shows incomplete fusion of the basi-occipital and 
basi-sphenoid, although it is almost certainly adult. No. 491 (female) shows 
the suture between the frontal and sphenoid bones obliterated on both sides, 
although the coronal suture is open. Several examples of fused nasal bones are 
noted in the remarks. 

The region of the pterion and supernumerary bones, The region of the 
pterion was examined on all the skulls, and the totals for which the sutures 
there are visible are given in the table below. Epipteric bones are recorded in 
the remarks on individual specimens, and, as usual, they are found to be diver¬ 
sified in number and size. The numbers of specimens showing contact between 
the temporal and frontal bones are given in the following table, the right and 
left sides being considered separately: 



<3 

~ 

if 


Immature 


It 

L 

It 

L 

R 

L 

Total no. of skulls with sutures at pterion 
visible on the side in question* 

257 

258 

138 

138 

01 

61 

No. of skulls with fronto-temporal articulation 

2 

3 

3 

3 

1 

2 

No. of skulls with pterion in K 



1 

2 

** ' 



* For the normal series (all tombs), oxeluding the posthumously and artificially deformed 
skulls, those showing premature closing of the sagittal or coronal suture, and No. 382. Among all 
those specimens excluded there is one ease of contact between the temporal and frontal bones. 


Of the four male skulls with fronto-temporal articulation, one shows the 
condition on the right side only, two show it on the left side only, and the last 
shows it on both sides. Of the five female skulls, two show the condition on the 
right side only, two on the left side only, and for the last it is bilateral. Of the 
two juvenile specimens, one has fronto-temporal articulation on the left side 
only, and for the other the condition is bilateral. Of the two female specimens 
with the pterion in K, one case is on the left side only, and for the other the 
condition is bilateral. The low frequency of cases showing contact between the 
temporal and frontal bones is comparable with those recorded for European 
Biometrika xxxi 8 
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series One of the male skulls suspected to be artificially deformed .shows fronto¬ 
temporal articulation on both sides, but all the others have normal pteria. 

Examples of one or more wormian bones between the temporal squamae 
and parietals were observed with a frequency which appears to be unusually 
high. There are ten eases of the condition exhibited unilaterally, and five 
bilaterally, among the male skulls; four unilaterally and four bilaterally for 
the female; and one unilaterally and one bilaterally among flu; immature, 
specimens, 

The normal series of 341 male skulls (all tombs) includes, the following eases 
of complete or partial division of the occipital bone: os spatial (preintorpariefai) 
2, interparietal bones 8 (only os pentagonals and os triamjulure R separate 2; 
only as pent, separate 2; only, os iri. It separate 2; only m in. L separate 1; two 
large bones meeting below lambda 1—No. (if), Plate XlVit), (races of hori¬ 
zontal suture of interparietal bones near aateria 11. The normal series of 267 
female skulls includes the following eases; os epactul 3, interparietal bones 2 
(only os tri. separate 1; only os pent, separate 1), traces of horizontal suture, 
of interparietal bones 9. The series of (11 immature specimens include the 
following cases: os epaclal 1, only os pent, separate 1. All the skulls not included 
in the normal series, on account of deformation or for other reasons, have 
normal occipital bones. The percentages of the occurrence of true interparietal 
bones, of one form or another, are 2-3 for the male adults, and !)•? for < he female 
adults. These values are close to those given for the Kerin a Egyptian aeries 
(Collett, 1933, p. 265), viz. 3-6 and 0-9%, respectively. 

One male and one female skull with an ossicle of bregma were noted in the 
total series. Ossicles of lambda of varying sizes were far more frequent. In 
general the sutures appeared to be moderately complicated and wormian hones 
in the lambdoid suture are by no means rare. A few cases of supernumerary 
bones in the sagittal and coronal sutures were noted, but these are all small, 
with one exception. This is a male specimen (No, 299), the sagittal suture being 
closed and the coronal closing. There is a large supernumerary bone in flu* right 
side of the coronal suture above the pterion. It is roughly triangular in shape 
(see Plate XIVa), with its maximum length over 56 mm. the inferior margin 
being broken—-and its maximum breadth 24 mm. The suture hounding this line 
anteriorly is symmetrically disposed with regard to the left half oil he coronal 
suture, which is normal, so the additional element appears to occupy part of 
the area normally occupied by the right parietal bone. 

Other anatomical anomalies. The rarest anatomical anomaly noted in the 
whole series is a case of complete absence of the right auricular passage. This 
specimen (No. 324) is male and Plate XV a shows the condition of the affected 
region. There is no opening in the bone taking the place of the right auditory 
meatus. The corresponding region on the left side is normally formed. Apart 
rom asymmetry in general form—the auricular region of the temporal and 
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adjoining parts of the base of the skull being more protruding on the left than 
on the right—there are no clear differences between the two sides of the base 
of the skull. Hrdlieka (1932-3) has discussed similar cases of complete congenital 
absence of the external auditory meatus and tympanic bone, with reference to 
seven American specimens, all of which axe affected on the right side only. 

An orifice in the right temporal squama of a male skull (No, 206) was noted. 
This is the end of a canal leading through the hone to the interior of the brain 
cavity. Another male specimen (No. 301) exhibits complete blockage of the left 
jugular foramen (see Plate XV :b), the right foramen being normal. 

Wounds. Small healed wounds were found on several of the skulls, and they 
are noted in the remarks given in the appended tables of individual measurements. 
One male skull (No. 47) has a healed wound on the left malar bone, but all other 
injuries noted are on the cranial vault. The most severe examples of completely 
healed injury are on three male—No. 5 (frontal bone, Plate XVIII a), No. 190 
(frontal bone), No. 301 (left parietal)—and four female—No. 419 (frontal, 
Plate XVIlie), No. 404 (frontal), No. 544 (frontal), and No. 670 (left parietal) 
—skulls. Only one specimen (No. 108, male, Plate IIIB), shows wounds which 
were probably inflicted not long before death. The outer table was completely 
removed in a region on the left, and another on the right, parietal: the edges of 
the affected area apparently show signs of healing, but the diploe is still exposed. 
There are long cracks in the vault of the same skull, but these are probably due 
to post-mortem injury. 

JHmised and other conditions. I am indebted to Dr L. W. Proger and Dr 
A. M. El Batrawi for commenting on the following and some of the exceptional 
specimens previously described. 

A female cranium (No. 602, Plate XVII) has what appears to be a diseased 
area extending over the greater part of the right side of the frontal bone. All 
the bones of the vault are softened and in a bad condition, with numerous small 
cracks, and the area in question is raised above the general level of the outer 
table. The roof of the right orbit is also affected and the endocranial surface of 
the right side of the frontal bone is slightly roughened. The inflammatory 
condition may possibly he due to osteomyelitis. Another male (No. I, Plato 
XVIII n) 1ms a roughened area on the parietal above the mid-point of the right 
side of the lambdoid suture. There seems to be no doubt that this is a patho¬ 
logical, and not a traumatic, lesion. 

A cranium, presumed to be male (No. 382, Plato XVI) is peculiar on account 
of the “swollen" appearance of its brain-box. This suggests hydrocephaly, but 
the hones are not exceptionally thin. The basi-oceipital is unusually short and 
broad, so the condition may bo due to achondroplasia. 

Artificially deformed skulls. There are eight of the Lachish skulls, all from 
Tomb 120, which are dearly artificially deformed, or which suggest this con¬ 
dition. The most marked cases are Nos. 381 (male) and 673 (female), both of 
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which happen to be metopic, and they are both young adult (hoc Plate VI), 
They are typical examples of fronto-occipital deformation, the two bones being 
flattened and a post-coronal depression being apparent in cadi case. Remarks 
on these two, and on the other six specimens, are in the appended tables of 
individual measurements, Of the latter, two male specimens (Nos. 376 and 378) 
show a clear suggestion of both frontal and occipital flattening, while the former 
shows slight post-coronal depression, but the latter does not. Nos. 377, 370, 
380 (male) and 451 (female), show some degree of frontal flattening only, but 
the occipital bones appear to be perfectly normal: of these No. 370 is the only 
one showing any suggestion of a post-coronal depression. No. 375 (male) shows 
some degree of both frontal and occipital flattening hut no post-coronal 
depression. Photographs of all these specimens, except No. 451, are reproduced 
in Plate VII. It is not possible to assert definitely that any of f hem were inten¬ 
tionally deformed in childhood. No. 380 has the coronal suture nearly obliter¬ 
ated and the sagittal and lambdoid open, and it is possible that its peculiar 
form is due solely to premature closing of the anterior suture. Artificial defor¬ 
mation was extremely rare in ancient Egypt, if it was ever practised there at all, 
There are records of affected specimens of earlier date than the Laeliwh from 
other parts of Western Asia, Crete, Cyprus, and some countries of Eastern 
Europe (see Dingwall, 1931). 

Trepanned skulls. The three trepanned skulls have been described by Dr 
T. Wilson Parry (1936), and new photographs of them arc given in Plates IV 
and V. He says that they are the first specimens exhibiting evidence, of this 
surgical operation to have been found in Asia, and no others have been dis¬ 
covered since in the continent. On two of the skulls (No, 114, an ageing male, 
and No. 115, a young adult male) a quadrilateral of bone has been removed by 
sawing, and it is said that there is no evidence that the primitive operation whs 
performed in the same way in any part of the world except Peru. The third skull 
(No. 340, a young adult male) shows the results of an operation of a different 
type. It is suggested that the individual had a depressed fracture, and that 
following this a piece of bone, which had become partly free m a result of the 
accident, was separated by sawing and removed. He survived long enough to 
enable the edges of the cavity to become completely healed, while the ot her 
two men must have died shortly after the operation. There appears to be no 
recorded case of a trepanned skull from ancient Egypt, Dr B&lrawi (1935, 
Plate XV) has given a photograph of a young adult female specimen of Merortie 
age from Lower Nubia with a circular opening on the right side of the frontal 
hone, supposed due to a trepan. 
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4. Remarks on the condition and anomalies op 

THE JAWS OP THE LaCHISII SKHLLS 

Considerably fewer than half of the skulls in the Laohish series have the 
upper dental arch complete or nearly complete. Table III gives statistics re¬ 
garding the loss of teeth before death for the adult specimens having complete 
upper jaws, and for the smaller number of adult mandibles with the dental arch 
complete. The percentages of cases with no teeth lost before death are high, but 
it must be remembered that the age constitution, judging from the state of the 
sutures, indicates a younger group, on the average, than that expected in a 
cemetery population, The frequencies of adult upper jaws with one or both 
third molars absent are not unusual, but it is customary to find the female 
percentage greater than the male. The samples are too small, however, to give 
a reliable sexual comparison in this respect. Remarks on the condition and 
anomalies of the upper jaws, and of the few mandibles associated with crania, 
are given in the appended tables of individual measurements. 


TABLE III 

The frequencies of different conditions of the teeth in 
adult jaws having complete dental arcades 



Upper jaw 

Lower jaw 



3 


(i) All teeth including third molars present at (loath 

01 

70 

13 

13 

(ii) Third molars apparently absent and no teeth lost 

6 

8 

1 

4 

before death 





(iii) One third molar apparently absent and no teeth 

5 

0 

0 

2 

lost before death 





(iv) Third molars erupted, or believed erupted, ami 

05 

20 

12 

12 

one or more teeth lost before death 





(v) Third molars apparently absent, and one or more 

4 

2 

3 

0 

teeth lost before death 





(vi) One third molar only apparently absent, and one 

1 

1 

1 

1 

or more teeth lost before death 





Total no. of complete arches 

132 

111) 

30 

32 

Total no. with no teeth lost before death: (i), (ii) 

72 

78 

14 

19 

and (iii) 

(54‘5%) 

(70-9%) 

(46-7%) 

(594%) 

Total no. with one or both third molars believed 
unerupted: (ii), (iii), (v) and (vi) 

16 

(12-1%) 

(10-0%) 

r> 

(16*7%) 

(21*9%) 
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Any dental anomalies of special interest were examined by Mr (', Bmvdler 
Henry, M.R.C.S., and he has kindly allowed me to incorporate his notes in the. 
following descriptions. The skiagrams reproduced in Plates XIX, XXII and 
XXIII were provided by him. 

Deflected canines. There are four adult upper jaws with a canine on one 
side grossly deflected and not completely erupted. 

No. Ill, male. The unerupted right canine is misplaced and buried obliquely, 
so that the tip of the crown is situated in the palatal alveolus, between the socket s 
of the first and second incisors, and the apex of the root is seen (uncovered by 
bone) in the external surface of the maxilla over the apex of the socket of the 
first premolar. The third molars were congenitally absent, and all the other 
teeth were present and in good condition. 

No. 142, male. The unerupted right canine is placed with the tip of its crown 
close to the anterior palatine fossa, and its root directed superiorly and pos¬ 
teriorly on the palatal side of the roots of the two premolars. It is possible that, 
the left third molar was either abnormally Hmall or else congenitally absent. 
The region of the right third molar is defective. 

No. 401, female: Plate XXI n. The anterior part of the dent a 1 arch is damaged, 
and it is not possible to judge how many teeth were present. The unorupted 
left canine is placed with the tip of its crown in the anterior palatine fossa, and 
its root directed externally, posteriorly, and superiorly, towards the malar 
process of the maxilla, and placed above the apices of the premolars. The right 
third molar is normal, but it is probable that the left third molar was reduced 
in size. 

No. 659, female. The left side of the palate and the part of the right side 
anterior to the canine are missing. The unerupted canine is placed with the tip 
of its crown near the normal position of the alveolus of the lateral incisor, and 
its root directed externally, posteriorly, and superiorly towards the malar 
process of the maxilla. The right third molar was probably erupted and lost 
before death. 

Supernumerary denticles , There are two crania and one mandible exhibiting 
this anomaly. 

No. 706, immature. The dental arch is symmetrical, with the left third molar 
erupting and the right apparently congenitally absent. There is a supernumerary 
denticle buried in the palate behind the right central incisor (see Plato X XI v). 
It is lying obliquely, with its crown in the anterior palatine fossa, and its tip 
impinging against the socket of the left central incisor. The condition of the 
residual root of the second right incisor suggests that this tooth had been broken 
during life. 

No. 383, female, All the teeth appear to have been present and in good 
condition at death. There is a small supernumerary denticle placed unusually 
far hack in the middle of the palate, to the left side of the suture (see Plate 



D. L. Risdon 119 

XXII c, d). The skiagram shows that the apex of the crown is directed 
posteriorly. 

No. 437, female. In the mandible all teeth were present at death, and there 
are five dental rudiments embedded in the outer alveolar margin. The positions 
of these can be seen from Plate XX. In the upper jaw all teeth were present at 
death, except the left third molar which had been lost. There are diastemae 
between the central incisors and between the lateral incisors and canines on 
both sides. There was post-normal oeolusion of the jaws, and the teeth were 
markedly and irregularly worn. 

Diastemae. There is one cranium showing diastemae between teeth, in 
addition to No. 437 described above. 

No. 132, male. The third molars appear to have been congenitally absent, 
and all other teeth were present at death in the upper jaw. There is.a diastema 
between the canine and first premolar on both sides (see Plate XXIb), 

Misplaced and missing teeth, and retained milk teeth. In addition to the four 
examples of deflected canines described above, there are four crania and two 
mandibles falling in this category. 

No. 72, male. The dental arch appears to be sufficiently roomy for the normal 
dentition, but crowding and some irregularity are present, due to the abnormal 
persistence of the right milk canine, and to the rotation of the left second 
premolar (see Plate XXIII c, i>). The right third molar appears to have been 
congenitally absent, and the left is unerupted with the occlusal surface of its 
crown facing posteriorly and slightly laterally in the posterior wall of the 
antrum, and in the developmental position of the tooth. The deciduous canine is 
a well-developed tooth retained in its functional position. Limitation of space 
has caused the permanent right canine to bo slightly rotated. The left second 
premolar is rotated so that its external, or buccal, cusp is antero-internal, and 
its internal, or palatal, cusp is postero-external. 

No. 445, female. No teeth had been lost before death, but there are only 
sockets for three in the right side anterior to the second premolar (see Plate 
XXI a). It is probable that the lateral incisor was missing. The antero-inferior 
part of the inter-maxillary region is deflected to the right, and the anterior 
palatine foramen appears to be completely absent. 

No. 40(1, female, The upper left third molar is below the alveolar margin 
and apparently impacted (see Plato XXIIh). 

No. 500, female. The dental arch is well formed, and all teeth except three 
molars are normally erupted. The left third molar was erupting at the time of 
death. The second and third right molars exhibit an unusual form of delayed 
eruption (see Plate XXII a, b). Neither tooth had emerged from the gum, 
although their direction and root formation appear to he normal, judging from 
the skiagram. The second molar is not impacted against the first, which might 
have accounted for the delayed eruption, but in fact they are clearly separated. 
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No. 1039, female. The left third molar of this mandible appears to be 
slightly ‘‘ over-erupted ”, and it has a backward tilt. In the region of the second 
and third right molars, which are missing, there is a deep pathological excava¬ 
tion, suggestive of an abscess cavity or cyst. The place ol the second premolar 
on the left side, which was congenitally absent, is taken by the retained deciduous 
second molar. The third molar on the left has an abnormally deep pit where the 
limbs of the cruciform fissures intersect: the aperture is 1 mm, in diameter. 
There are two smaller cavities in the occlusal surface of the second molar on 
the same side, 

No. 1068, immature. This mandible had lost no teeth before death. The 
canine and first premolar on the right are unerupted and completely buried, 
the premolar lying obliquely towards the canine so that their crowns are in 
contact (see Plate XXIII a, b). A skiagram shows that the same teeth, unerupted 
on the other side, are in a similar condition. 

Diseased conditions of the jaws. Only the more marked cases of diseases of 
the jaws were noted. Notes on a mandible (No. 1039), with a large abscess 
cavity or cyst, are given above, and there are three skulls and two mandibles 
exhibiting similar conditions. 

No. 14, male. All upper teeth were present at death except the lateral 
incisor and canine on the right side. The shrunken and eburmited condition of 
the alveolus suggests that these two teeth might have been lost before death 
through traumatic injury. 

No. 467, female. Nearly all teeth had been lost before death, and there is 
a large cyst in the anterior part of the palate, penetrating to the nasal aperture 
(see Plate XXIII f). 

No, 469, female. All the upper molars had been lost before death, and there 
is a large abscess cavity or cyst in the molar region on the right side (see Plate 
XlXIIlE). 

No. 1017, male. No teeth had been lost from the right side of this mandible 
before death, the left side being defective. There is a large carious cavity in the 
posterior half of the first molar, which evidently involved the dental pulp. In 
consequence of this, an abscess formed which discharged through a sinus in the 
external alveolar plate, over the apex of the posterior root of the tooth. 

No. 1061, female. This mandible had lost no teeth before death, the third 
molars being congenitally absent. The bone of the external surface immediately 
above the left mental foramen is diseased, 

Apparent adventitious filling ofatooth. A female cranium (No. 618) is remark¬ 
able on account of the fact that a piece of metal was found firmly embedded in, 
and level with, the occlusal surface of the second right molar near its centre. 
The fact that the surface of the metal that could be seen was flat, and that its 
edges conformed to the surface of the tooth, shows conclusively that it must 
have been where it was found during the life of the individual. Its appearance 
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was precisely similar to that of an artificial stopping. Plate XIX reproduces 
photographs of the whole palate and of the right molar region, before and after 
the removal of the metal, and a skiagram showing its depth, The filling was 
removed and its dimensions were found to be 1 x 3 mm. (exposed surface) by 
1 mm. (height). The cavity in the tooth, which shows no sign of disease, is almost 
circular in form, and its position, where the fissures between the cusps join, is 
one in which a small pit is sometimes found. The description of mandible 
No. 1039, on p, 120 above, refers to pits of this kind in the occlusal surfaces of 
a second and third molar. There is a small pit in the surface of the second right 
molar of skull No. 518. These circumstances suggest that the person accidentally 
bit the piece of metal, which became lodged in the natural pit of the second 
molar and worn down as teeth are normally. Such an explanation appears 
much more plausible than an alternative one which might attribute the filling 
to surgical interference. 

5. The statistical nature op the material 

The statistical nature of the Lachish series is discussed in this section, 
topics considered being the question whether the male and female adult and 
immature samples can ho supposed to represent the same population or not, 
sexual comparisons of average types and variabilities, and allied matters. As 
the series came from four tombs, which were adjoining, it is advisable to ask 
first whether there are any significant differences between the series from each, 
and whether it is legitimate to pool all the material for statistical purposes. In 
making all metrical comparisons, the specimens patently, or apparently, 
artificially deformed, those showing premature closing of the sagittal or coronal 
suture, and the one of unusual (? hydrocephalic) form were excluded. The 
numbers in these exceptional groups are in the table below, and measurements 
of the specimens, which are all adult, are given separately in the appended 
tables of individual measurements. 


The numbers of anomalous crania 


Tomb 


<f 

9 

120 

Artificially deformed 

6 

1 


Premature closing of sagittal suture 

11 

6 


Premature closing of coronal suture 

1 

— 


Hydrocephalic? 

1 

*’— 


When seen all together, the series from each tomb showed a variety of 
“types”, but no specimens which clearly stood out from the others on account 
of their characters were noted, with the exception of two males, viz. No. 166, 
Tomb 120, and No. 179, Tomb 116. Noma facialis views of these two are 
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given in Plate III c, n, and it will lie seen that they are ot a very .similar type, 
which is distinctly different from that of the male specimen (Plate X, right), 
which was selected because none of its measurements arc at all peculiar (see 
p. 163 below). The cephalic index of No. 15(1 (81-7) is high, but not extreme, 
and that of No. 179 (784) is less exceptional. In spite of these two, it was 
thought best not to exclude any specimens from the aeries on account at their 
appearance. 

In making male comparisons between tombs, it was necessary to pool data 
from Tombs 108 and 110, as the series from them are too small to stand alone. 
Male and female means are given in Table IV for various series from single tombs, 
or groups of tombs, in the case of those coefficient of racial likeness characters 
for which the pooled means for Tombs 107, 108 and 110 are based on more 
than ten skulls. It will be seen that the series from Tomb 120 is the only one 
long enough to provide adequate statistical constants. The others are just lung 


TABLE IV 

Mean measurements of series of Lachish crania from 
different tombs 




Male aeries from tombs 


Female mim front tomim 







1 


120 

107 

108, 110 

107,108,116 

120 

107,108, HO: 

i 

L 

B 

B' 

H' 

8 

w 

V 

fml 

jmb 

LB 

G'H 

NB 

NH,L 

0,L 

OJj 

Of 

0\ 

100 J3/L 

100 H'jL 
100 B/H' 
Oc.L 

100 fmb/fml 
100 NBINH 
100 OJO v L 
100 

AL 

184*3 (266) 
137*1 (276) 
96*4 (263) 
133*8 (226) 
376*9 (221) 
308-7 (265) 
618*1 (266) 
37-0 (211) 
30*6 (213) 
100*6 (206) 
70*1 (76) 
26*2 (96) 
61*3 (103) 
41*4(117) 
32*8(120) 
47*0 (72) 
40*4 (43) 
74*5 (269) 

72*7 (216) 
102*6 (219) 
59*6 (238) 

82*9 (196) 
49*6 (88) 
79*3 (113) 
85*7 (33) 
64°*0 (70) 
73°>8 (70) 

185*0 (32) 
136*2 (28) 
96-6 (32) 
133*2 (24) 
372*7 (20) 
307*7 (29) 
519*0 (27) 
30*4 (21) 
29*7 (20) 
101*3 (21) 
68*8 (14) 
25*0 (18) 
51*7 (20) 
41*8 (21) 
33*6 (10) 
46*3 (10) 
39*8 (8) 
73*6 (28) 
72*6 (24) 
103*2 (20) 
60*0 (26) 
81*8 (16) 
48*7 (17) 
80*4 (18) 
86*9 (6) 
64°*9 (12) 
74°*0 (12) 

185*6 (24) 
134*6 (23) 
96*1 (24) 
134*3(18) 
372*6 (14) 
308*3 (22) 
517*1 (22) 
38*0 (15) 
30*4(11) 
101*1 (17) 
72*6 (8) 
24*9 (9) 
62*1 (13) 
42*0(10) 
33*3(13) 
46*4(12) 
39*6 (0) 
72-6 (23) 
72*0(18) 
100*8 (17) 
69*4 (15) 
79*7 (11) 
47*9 (9) 
78*3 (10) 
86*0 (6) 
62°*2 (7) 
74°*6 (7) 

185*2 (56) 
135*6(51) 
95*9 (50) 
133*7 (42) 
372*6 (34) 
308*0 (51) 
518*2 (49) 
37*1 (36) 
30*0(31) 
101*2 (38) 
70*2 (22) 
25*0 (27) 
51*8 (33) 
41*9 (31) 
33*5 (32) 
45*7 (28) 
39-7 (14) 
73*1 (61) 
72*3 (42) 
102*1 (37) 
69*8 (40) 
80*9 (27) 
48*4 (26) 
79*7 (28) 
86*5(11) 
63°*9 (19) 
74°*2 (19) 

176*6 (2(H)) 
133*1 (20*1) 
92*2 (liKI) 
128*3 (106) 
362*9 (104) 
297*6 (183) 
500*1 (170) 
35*8(151) 

29*0 (154) 

1)0*0 (169) 
07*2 (69) 
24*0 (02) 
49*1 (78) 
40*0 (78) 
33*1 (78) 
*15*0 (03) 
39*2 (48) 
75*5 (190) 
72*7 (102) 
103*7 (102) 
60*1 (100) 
81*5 (13*4) 
50*3 (59) 
81*7 (69) 
87*0 (39) 
<H°*4 (54) 
73°*9 (54) 

177*0 (59) ! 
133-H (57) 1 
92*1 (55) | 
128*0 (47) 
303*5 (48) 
298-H (51) 
501*4 (44) 
35*9 (42) 
28*B (38) 
95*7 (47) 
00*4 (28) 
24*3 (20) 
48*0 (38) 
40*4 (33) 
33*4 (34) 
44-4 (27) 
39*4 (20) 
76*4 (50) 
72*8 (47) 
104*3 (44) 
59*7 (49) 
«()■« (33) 
49*7 (25) 
82*8 (31) 
80*5(16) 
64 -7 (21) 
73°*3 (21) 
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enough, however, to make comparison of interest, The following crude coeffi¬ 
cients of racial likeness,* using standard deviations for the Tomb 120 series 
alone, were found: 

Male, Tomb 120 (177-7f) and Tomb 107 (21-4) ... -0-45 ±0-19 (26$). 

Male, Tomb 120 (202-1) and Tombs 108, 116 (16-0) ... 0-44 + 0-21 (20). 

Male, Tomb 107 (23-9) and Tombs 108, 116 (16-6) ... -0-18 + 0-21 (20). 

Male, Tomb 120 (167-3) and Tombs 107, 108, 116 (34-6) ... 0-41 + 0-18 (27). 
Female, Tomb 120 (116-3) and Tombs 107,108,116 (36-8) ... -0-20 + 0-18 (29). 

It is known from experience that no reliance can be based on comparisons 
of measurements for short series of skulls. Two series, each being made up by 
fewer than twenty specimens, and actually representing two distinct races, are 
quite likely to indicate an insignificant difference, and series of fifty or more are 
usually required before any reliance can be placed on comparisons such as those 
given by coefficients of racial likeness. All of the live values above may be con¬ 
sidered to differ insignificantly from zero, so the evidence, as far as it goes, 
indicates that the series from all the tombs may have been random samples 
from the same population. Larger samples would be required to justify this 
hypothesis in an adequate way, but for practical purposes there can be no 
objection to pooling all the material, on the supposition that it represents a 
single population. This conclusion is in conformity with the archaeological 
evidence. 

It appeared worth while to compare the variabilities of two sub-groups of 
the total material. All the standard deviations given in Table V are for forty 
or more skulls. Two of the differences for corresponding constants might be 
considered just significant if considered by themselves (<$ L, d/r.n. A = 3-4, and 
d S it 3-2, the Tomb 120 constant being the greater in both these cases), but the 
conclusion for all characters must be that the two groups are almost identical 
in variability. 


* With the usual notation, the form of the crude coefficient used is: 


M 


V«.+»»- 


[ m , — m ,.) 8 

c 


) - 1±(>67449 V /_|=^ £(a)—l±0-67449 J 


2 

M' 


If it, is the mean number of sknllH available for the characters used in the case of the first series, 
and ?7,. is the auno for the second series, then the “reduced” coefficient is defined to be: 


60 


X - Z'(a)- ll ±0-67449 . /-| x 50 X . 

(M J \ M n,n,< 


f This figure is the average number of skulls for the Beries ( n }, in the ease of the comparison in 
question, no means based on fewer than ten specimens being used, 

f The number in brackets following a coefficient is the number of characters on which it is based. 
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TABLE V 

Standard deviations for series from Tombs 107, 108 
and 116 combined, and from Tomb 120 




Male 

Female 

Gluufl'OLoi 

Tomb 120 

Tombs 107,108,110 

Tomb 120 

Tombs 107,108,110 j 

L 

0-03+0-18* 

4-82 + 0-31 

5-03+0-17 

5*19 ±0-32 

B 

4-90+ 0-14 

6-65 + 0-38 

4-0K + U-10 

3-95 MI-25 

B' 

4-16+0-12 

4-19 + 0-27 

4-53 + 11-10 

3-89 + 0-25 

h: 

4-85+0-16 

6-73 + 0-42 

4-18+ 0-16 

4-32 + 0-3(1 

B t 

54)7 + 0-18 

6-84 + 0-40 

6-87 ±0-2(1 

5-13 + 0-32 

St 

7-44+0-21 

6-96 + 0-41 

7-18 + 0-25 

7-02 + 0-45 i 

$3 

7-04 ±0-22 

6-80 ±0-44 

6-68 ±0-24 

0-84+0-40 

w 

9-64 ±0-29 

9-78 + 0-66 

9-47 + 0-33 

8-54+0-57 

u 

13-86 + 0-41 

11-66 ±0-79 

12-54 ±0-40 

12-52 + 0-9U 

100 B/L 

3-02+ 0-09 

3-20 ±0-22 

3-01 +0-10 

2-91+0-lK 

100 H'lL 

2-88 ±0-09 

3-18 ±0-24 

3-02 ±11-10 

2-51 ±0-17 

OcJ, 

2-62+0-09 

2-36+0-18 

2-51 ±0-09 

2-30 + 0-10 


* The symbol + indicates probable errors throughout thin paper. They wore used instead of 
standard errors as probable errors have been given far more frequently than standard errorw in 
craniomotric studies. 


The data for material from all four tombs were pooled, ho that a single 
Lachish series, subdivided according to sex and age, is considered in all com¬ 
parisons made below. It may be asked, next, whether the total adult male and 
female series can be supposed to represent the same population or not. The 
constants for them are given in Table VI. 

, The corresponding mean measurements of shape—indices and angles -for 
the two sexes are obviously very close. The differences only exceed three titties 
their probable errors in the case of; 

100 B/L (d/r.E. A = 6-7), 100 BIW (4-5), 100 (fl-//')//, (All), Or.L (3.3), 

100 0 % (O x , L (6-6), 100 BBfSG (6*7), 100 fmb If ml (3-4), 100 SjC, ml (f>*3). 

In the case of the first six of these characters, it is normally found that the 
male and female means for long series show small differences of the same signs 
as those observed for the Lachish series. For the last character there is no good 
comparative material, but the longest series available, described by Woo {1937), 
shows a sex difference of the same sign. As far as can be told from these com¬ 
parisons, the male and female Lachish series represent precisely the same 
population. 



TABLE VI 

Constants for the male and female series of adult crania 
from LacMsh (all tombs combined) 


Standard deviations j Coefficients of variation 



* Determined from reconstruction formulae: boo Appendix I. 
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The few juvenile and more numerous adolescent skulls together, a total of 
sixty-one unsexed specimens, give the mean indices in the following table, 
where comparison is made with the male and female adult means. 

Immature j 

.- i 

715-2 (S3) | 

72 (5 (11) ; 

104-0 (HO) 1 
2-7 (.*)«) ; 

7(5-0 (Hr.) ! 

5*2-0 (22) ; 

83*8 (22) j 

.._ .... i 

The sequence, male adult—female adult---immature (unsexed) mean, is ex¬ 
pected in the case of the last three of these indices. The first four give means 
for the immature series which dilTer quite insignificantly from the female adult 
means, and a close correspondence of this kind is expected. There is thus every 
reason to believe that the children and adults belonged to the same population. 

The sex ratios (male mean/female mean) for a few measurements of size are 
given in Table VII for the Laohish, three Egyptian, and four English series. 
All the means involved are based on eighty-six or more crania, 

TABLE VII 

Sex ratios for the Lacldsh and other scries* 

'Inglish, j English, 

White- Kpitnl* 
chapel ! fields 


1*1145 I 1 -115(1 

Ht'W 1 MW) 

1*041 

1*053 1-014 

1*0511 1*053 

1-0-lK 1 (152 

1*05(1 1*058 

I .. 

* The series are: Egyptian IS, OiV.eh, 26th-30fch dynasties (I)avin k IVawon, 1024); Kenim, 
12th~13th dynasties (Collett, 1033); benderah, (ith-12th dynasties (Tlumwon & Maelver, UHW); 
Earringdon St, seventeenth century (Hooke, 1920); Whileahapd, Hewntuenlli century {Mawlonell, 
1904); Spitalfields, Roman or medieval (Moraufc & Headley, 1031); Hytile, medieval (8Pa«ii!* r 
& Morant, 1932). 

t Eor It in place of H\ These very similar calvarial heights give almost identical sex ratios in 
the case of long series for which both arc given. 

The characters are arranged in the table in order of the sex ratios for the 
Laohish series. The three Egyptian series give very similar orders, that for the 


English, 

llythe 


104(5 
1-040 j 
MWH 1 
1041 
I -054 1 

1 03 H * 
1-053 
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Kerma being particularly like the Lachish. As is usually found, the pre¬ 
ponderance of the average male over the average female skull is clearly different 
for different measurements, the sex ratios being rather less for cranial diameters 
than for stature and the lengths of the long hones. It is curious that the orders 
in which the characters are arranged by the Lachish and Egyptian sex ratios 
are distinctly different from all those given by the constants for the four English 
series, while these last show a general, though not very close, correspondence 
inter .se. There appears to be a distinct difference between the sex differences of 
the Lachish and ancient Egyptian races, on the one hand, and the later English 
ones, on the other. 

Measures of variability for the total male and female adult series of crania 
from Lachish are given in Table VI. Comparison of the relative degrees of 
variation exhibited by the material for the two sexes may be considered first. 
For the thirty-three absolute measurements the male standard deviation exceeds 
the corresponding female value in twenty-four cases, and for the remaining 
nine the position is reversed. In one or two instances the excess of the male 
over the female constant is clearly significant, but where the female standard 
deviation exceeds the male the differences arc quite insignificant. In comparing 
parallel samples representing the same population it is customary to find a 
slight preponderance of male over female variation for size characters. A closer 
approach to equality is normally found for these if coefficients of variation arc 
used. 

The following comparisons are based on coefficients of variation for absolute 
(size) characters, and standard deviations for measurements of shape (indices 
and angles), the total number of characters being fifty. In twenty-four cases the 
male measure of variation exceeds the female, and for the other twenty-six the 
position is reversed. The only difference which exceeds three times its probable 
error is for LB (ri/l'.E. A - 3-2), but no importance whatever can be attached to 
a divergence of this order in fifty comparisons. The male and female series thus 
show a remarkably chose approach to equality in variation, and the agreement 
in this respect again suggests that a single population is represented. 

It may he asked next how the variability of this population compares with 
that of others which arc usually accepted as being racially homogeneous. It will 
be sufficient to make comparisons with the long series of 2((th--30th dynasty 
skulls from (.lizeh (Ravin & Pearson, 1924), as these have often been used for 
the purpose. There are thirty-three characters for which the constants required 
(coefficients of variation for absolute measurements and standard deviations for 
indices and angles) are available. In the case of the male series the Lachish 
value exceeds the Egyptian £ in fifteen cases, and the Egyptian £ exceeds the 
Lachish in the remaining eighteen. Differences exceeding three times their 
probable errors are only found in the case of three measurements of shape, viz. 
100 BjL (d/r.E. A- 4-0, Lachish cr the greater), ri£(4-l, Egyptian E greater), 



128 Cranial and other tinman Remains from Palestine 

and Oc.I. (8-3, Egyptian hi greater). In the ease of the female series the Lachish 
constant exceeds the Egyptian E in twenty-three eases, and the Egyptian K 
exceeds the Lachish in the remaining ten. The “significant” differences are 
for BL (3-1), B' (3-8), 100 BjL (4-2) and LB (4-0), the Lachish variability being 
the greater for these four, and Oc.I. (7-2), in which case the Egyptian Err 
exceeds the Lachish. It is clear that there is a close agreement between the 
variabilities of the two populations, and, in fact, it is not possible to say that one 
was more homogeneous than the other. No importance can he attached to any 
of the differences for single characters, except those for the occipital index, 
which is decidedly more variable for the Egyptian series in the case of both 
sexes. 

Comparison with data for other series shows that the Lachish standard 
deviations of the occipital index aro quite unexceptional, while those for the 
Egyptian E series are peculiarly large, in view of the fact that all other characters 
for it indicate a low order of variability, it may he noted that a few of the 
Lachish crania (excluded from the series) are clearly artificially deformed. If 
artificial deformation had been generally practised to a slight degree, (Ids would 
have been expected to increase the variability of the occipital index. .Such an 
effect is not observed, however, and it is very probable that; the exceptional cases 
noted are the only ones exhibiting artificial deformation. 

All the statistical evidence thus points to the fact that the male and female 
adult and immature Lachish series represent random samples from a single 
population, which showed the same order of variability as ancient Egyptian 
populations. It is known that these were slightly less variable than Western 
European populations in later times. 


6. The redationships of the Lachish and Ancient Kuyitian populations 

JUDGED FROM COMPARISONS OF MEAN MEASUREMENTS 

The Lachish is the first series of skulls of any period from Palestine which is 
of an adequate length, and for which adequate measurements arc available, 
A rough comparison of mean measurements for it showed that the type is very 
similar to that of certain ancient Egyptian series, and this was fully nulmtiu 
tiated by statistical comparisons. As far as can be judged from the appeuranu 
of the skulls, no clear distinction from dynastic Egyptian material is evident. 
Comparisons with Egyptian series only are dealt with in this section, and 
reference is made to other material from Palestine in § 9. 

Ear more skulls have been preserved and studied from an anthropological 
point of view from Egypt than from any other country in the world. The vast 
majority of these relate to Roman and earlier times, and no comparisons with 
the meagre post-Roman material from the country were made. Most of the 
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longer series of male Egyptian skulls representing populations of the period 
considered have been compared statistically by Morant (1925). A selection of 
the material with which he dealt was made, and certain series were excluded 
for various reasons, viz.: 

(i) Eouquet’s predynastic—the so-called Aeneolithic—series from Middle 
Egypt. The male means of size characters for this series are decidedly larger 
than those for any other predynastic series dealt with in Morant’s paper, or 
described since, while its mean indices are very close to those for some con¬ 
temporary series. It appears to be very probable that the distinction was due, 
either to a process of selection which favoured the larger specimens, or to 
inaccurate sexing. While the series is suspect it appears better not to use it for 
comparative purposes. 

(ii) The series measured by Broca and Chantre, for which the means are 
unsatisfactory as they are not recorded in fine enough units. 

(iii) Four other scries, each comprising fewer than thirty skulls. 

(iv) The El Kubanieh South series, measured by Toldt, which relates to a 
period covering Early and Middle Dynastic times, while all the other series 
relate to shorter intervals. 

The modern Abyssinian series measured by Sergi, and reduced by Morant, 
is included in our list. This accounts for eighteen of the series used. The three 
other ancient Egyptian series included are the 9th dynasty from Sedment 
(Woo, 1930); the Kerma (Collett, 1933)—from a Nubian site, although the 
population was unquestionably of Egyptian type—and the pooled early pre¬ 
dynastic Badari series (Morant, 1935), measured by Stoessiger (1927) and 
Derry. Including the Lachish, there is thus a total of twenty-two male series, 
made up by about 3000 skulls. The localities from which the collections were 
obtained are shown in the sketch-map, Pig. 2. 

Reduced coefficients of racial likeness between all pairs of the twenty-two 
series are given in Table VIII. Many of these were obtained from the crude 
coefficients given by Morant (1925), and others were copied from tables given 
by Woo (1930), and Collett (1933). In addition to these I was able to use a 
number of unpublished values calculated by Morant, and to complete the 
table I calculated all the coefficients with the Lachish series and several others 
which wore wanting. The n for a particular series in the table gives the average 
number of skulls on which the means are based, in the case of all the characters 
used in computing the coefficients which are available for the series. The na 
range from 29-9 to 855*4, and there axe only six, including the Lachish, greater 
than 100. Experience lias suggested that a series of fifty complete skulls of one 
sex, or a larger number of incomplete specimens providing an equivalent 
amount of evidence, is required in order to give reliable racial comparisons with 

Biomctrika xxxt 9 
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a particular population. Of the twenty-two series, there are eight which fail to 
satisfy this ideal requirement, but these may he supposed large enough to give 
fairly reliable comparisons, 

When possible coefficients of racial likeness between cranial series have been 
based on a standard set of thirty-one characters, nineteen of these being ab¬ 
solute measurements (chords and ares), seven indices, and three angles. .For the 
Egyptian material considered it was only possible to use the complete set in a 
few cases, owing to the fact that one or several of the measurements are not 
available for the majority of the series. In all cases as many as possible of the 
selected characters were used in computing the coefficients. The distribution of 
the numbers of characters on which the 231 coefficients could be based is: 


No. o{ characters 


24 

25 

26 

27 

28 

2Q 

30 

31 

No. of comparisons 

185 

2 

5 


5 

16 

14 

7 

12 


More than half of the coefficients are thus based on only fourteen characters, 
all these being comparisons of the ten series measured by Thomson and Maclver 
with one another, and with the remaining twelve series. The coefficients between 
these twelve are all based oil a number of characters which can be supposed 
adequate, hut it may he suggested that, comparisons between them and those 
for which only fourteen characters are involved are likely to be unreliable. The 
question of how far the generalized estimate of resemblance for fourteen 
characters is likely to diverge from that obtained from about twice as many was 
investigated by computing some of the more reliable coefficients for the in¬ 
complete set of fourteen characters available for Thomson and Maclver’s series. 
Comparisons of the corresponding reduced coefficients for (a) all of tire characters 
available, and (b) for the fourteen characters only, are made in Table IX, in 
the case of fifteen pairs of series selected at random. 

For seven of the fifteen comparisons the reduced coefficient for fourteen 
characters is greater than the corresponding valuo for the larger number, and 
markedly greater in four eases; for the remaining eight comparisons the position 
is reversed, but there arc no marked differences between the pairs. It appears 
that the generalized estimate of divergence derived from the fourteen characters 
can usually lie expected to give a fairly close approximation to that which would 
be obtained from about twice as many characters, but occasionally it will 
suggest a rather misleading conclusion. In the ease of the material considered, 
the fourteen characters show a tendency to indicate a wider separation of the 
types than that likely to he found if more evidence were available, and this 
might have been anticipated from the fact that they include the characters 
which normally show the most significant differences in the comparisons of 
Ancient Egyptian material, viz. B, 100 BjL, J and JVZ.. 


9-2 
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TABLE IX 

Comparisons of corresponding pairs of reduced coefficients of racial likeness [male 
series) for all coefficient characters available mul for a set of fourteen ehurarters 



Reduce*! etM-fiii-ieiitM 
of raeial likeness 

All 

characters 

— -- 

14 

characters 

Laehish (191-5*) with Thebes, 18th -2()th dytt. (54-1) 

2-13 4 0*23 (25) 

0-01 +O-30 

Naqadft A and Q (154-9) with Hi Kubanioh North (33-1) 

3-91+0-41 (28) 

0-11 + (1-58 

Abydos, 18th-19tlulyn. (31-5) with Thebes, IKt-h-2l)th dyn. (54-1) 

5-13-1 (M8(20) 

8-70 (-0-04 

Laehish (191-5) with Thebes, 18th~21sfc dyn. (1(17-9) 

5-27 Ml-11 (20) 

3-231:0-14 

Laehish (191-5) with (lizeh, 2(ith-3(lth dyn. (H(i9-5) 

5-77 +U-UII (29) 

5-37,1 ii-OH 

Laehish (101*5) with Abyssinian (92-9) 

9-89 ±0-211 (20) 

4-23 + 0-27 

Laehish (191-5) with Korina (107-11) 

7-80 + 0-13 (29) 

7-78+ o-Ul 

Abyssinian (02-0) with Ml Kubanioh North (33-1) 

8-08+0-44 (25) 

5-50 Mi-55) 

Sodment (37-5) with Thebes, 18th-2M dyn. (1(17-9) 

9-18 + 0-29(30) 

1 1-051 * 0-42 | 

Thebes, 18th--20th dyn. (54-1) with Abydos, 1st dyn. Royal (33-2) 

12-032.0 -14 (29) 

10-03 i O-02 j 

Abyssinian (02-0) with Abvdos, )Kt-h-19th dyn. (31-5) 

13-04 +0-10(25) 

111:37 + 0-01 

Sediment (37-5) with Abydos, 18th-19th dyn. (31-5) 

13-12 * IMSU(2K) 

13-97 Ml-73 

Naqada A and Q (04-9) with Uizoh, 20th-30th dyn. (809-5) 

14-35 *0-14 (30) 

19-03 i 0-21 

Laoiiish (191-5) with Ikdari ((SI-1) 

24-12 + 0-22 ( 29) 

21-07 • 0-27 

Badari (OH) with Abydos, 1st dyn. Royal (33-2) 

30-24+0-44 (31) 

45-33 < 0-59 


* Tho numbers in brackets following the tit-lea of the series give, the average numbers of skulls 
(ii's) in tho case of tho fourteen coefficient of racial likeness character available for all the series. 
In tho case of the comparisons for “all characters" tho it's ant changed, but they are invariably of 
tho samo order. Fuller particulars regarding tho series arc given in Table VIII. 

The point in question can be examined in an indirect way by comparing tint 
distributions of reduced coefficients in Table V.III based [a) on fourteen elm- 
racters, and (/>) on twenty-four to thirty-one characters. These are: 



In- 

signi¬ 

ficant’ 1 ' 

Signi¬ 

ficant 

and 

<3-5 

3-5-5 

6-13 

-- - 

13-20 

20 30 

30 40 

40 50 

Tutu)** 

14 olmractors 

9 

10 

18 

72 

27 

17 

5 

1 

)05 

24-31 characters 

0 

4 

(1 

31 

17 

5 

2 

1 

00 

All comparisons 

9 

20 

24 

103 

44 

22 

7 

2 

231 


* A coefficient is counted hero as insignifloant if it differs from zero by less than 3*5 times its 
probable error. 


Allowing for the differences in their sizes, these two distributions appear tc 
be very similar, and it is clear that the coefficients for fourteen characters onlj 
do not show any marked tendency to diverge from the others on the average. 
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It is curious that there are no insignificant coefficients based on twenty-four 
or more characters, but little importance need be attached to this fact. 

In computing all the reduced coefficients of racial likeness given in Table 
VIII, the Egyptian E standard deviations (Bavin & Pearson, 1924) were used. 
It is thus assumed that the variabilities of all the populations are so closely 
similar that any one of them can be adequately represented by the constants 
for the longest series of the group available. It has been shown above that the 
Laehish standard deviations are very similar to the corresponding values for 
the If series. The constants for three of the other series—viz, the Badari, Sed- 
ment, and Kerma—have been compared with the Egyptian E in the same way, 
and it was found that the variabilities indicated by them are also of almost 
precisely the same order. At the same time it is known that the Ancient Egyptian 
series show a clear tendency to be rather less variable than Western European 
series of later times. 

In the comparison of any two series the full coefficient of racial likeness 
involves both sots of standard deviations. The question of the extent to which the 
assumption that the population variabilities were precisely the same is likely to 
affect reduced coefficients was examined in an indirect way in the case of a few 
comparisons, For twenty-nine of the comparisons reduced coefficients happened 
to be available computed by using both (a) the Egyptian E as, and (b) the 
Farringdon St (seventeenth-century London, Hooke, 1926) rr’s. In every case the 
former value is greater than the latter, as would have been anticipated, and the 
ratio of the reduced coefficients found with Egyptian E ex’s to the corresponding 
values found with Farringdon St cr’s, range from M)5 to 1-79, while for 21 of the 
29 ratios, the range is between 1-3 and 1*G. The Farringdon St variablities are 
obviously inappropriate for use in comparisons of Ancient Egyptian material. 

For the ten cases giving the largest ratios the reduced coefficient was also 
computed using the Laehish cr’s, and the values for the three sets are given in 
Table X. The ratios of coefficients found with the Egyptian E a ’s to the 
corresponding values found with the Laehish a ’s range from 0-91 to 1-17. For 
seven of the ten comparisons the use of the Laehish <r’s gives the lower value, 
hut all the differences here are so small that they may be considered of no 
importance. 

The reduced coefficients of racial likeness which were used in this paper to 
provide a classification of a number of Ancient Egyptian and related populations 
are clearly not statistical estimates of an ideal kind. Owing to the nature of 
the material, certain devices have to bo used in practice which are likely to have 
an appreciable affect on the coefficients obtained. It has been shown above that 
there is good reason to believe that the use of a constant set of standard devia¬ 
tions does not distort the value to any appreciable extent. The use of different 
sets of the complete group of thirty-one characters appears to be of far more 
practical importance. More than half of the 231 coefficients computed are 
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actually based on fourteen characters only, the numbers tor the remainder 
r angin g from twenty-four to thirty-one. Comparisons with the shorter.'series might 
have been omitted, but this would have reduced the evidence to such an extent, 
that it appeared better to include them, while remembering their imperfection. 
There are other objections to the constants. It is known that the theoretical 
requirement that all the characters used should be uneorrolated with one another 

TABLE X 


Corresponding reduced coefficients of racial likeness for male, cranial mm computed 
by using (1) the Egyptian E, (2) the Lachish, and (3) the, Famtujdon Hi standard 
deviations 


■ 

No. 

With 

With 

With 


of elm* 

Egyptian K 

Lurliisli 

Earrmydiin 


meters 

rr*H 

if h 

•St. m's 

Abysslniana with Abydos (Early prodyn.) 

14 

12-60 i 0-52 

10-70 ±0-52 

7-04 + 11-52 

(S 62-0) (w 4(H)) 





Abyssiniana with El Amrali and Hou (Late predyn.) 

(it 62-6) (n 10(H) 

El Amrah and Hou (Late predyn.) with Thebes (18-20 dyn.) 

14 

7-04 ill-32 

0-33+ 0-32 

... 

4*111 ±0-32 

14 

10-08+0-311 

8-80 + 0-31! 

0'lil ±0*3(1 

(n 105*6) (>T 64-i) 

Abydos (Early predyn.) with Abydos (12-15 dyn.) 

14 

4>(IH±(Wil 

4-34+0-51 

3* 13;±l)*61 

(n 40-9) (n 65b) 

Naqada A and Q with Abydos (12-15 dyn.) 

14 

2-38 ± (KID 

2-311 + 0-31) 

1*33+11*31) 

[n 64-9) (« 65-9) 

Thebes (18-20 dyn.) with Thebes (18-21 dyn.) 

23 

1-43 ±0-24 

1-57 ±0-24 

0-18! ±11-24 

(« 53 d) (ft 166-4) 




Naqada A and Q with Thebes (18--20 dyn,) 

27 

8-27 ±0-3(1 

7-08 ±0-3(1 

5-18,40-30 

(n 66-9) (n 53-4) 

Thebes (18-20 dyn.) with Egyptian E 

25 

3-37 + 0-18 

3*45 ± O'18 

2-35 + 0*18 

(ft 53-2) (S 866-0) 

Denderah (Roman) with Abydos (12-15 dyn.) 

14 

6-81 + 0-45 

5*88 ±0-45 

4*34 + 0-45 

(u 49*3) (ii 65-9) 

Thebes (18-20 dyn.) with Deshasheh (4-6 dyn.) 

14 

2-38 ±0-118 

2-25 + 0-51! 

1*42 ±0-50 

(fi 54-1) (H 39-9) 



is not fulfilled, some pairs of the characters being almost certainly quite highly 
correlated in all the series, but it is probable that the relative values of the 
estimates of resemblance are not more disturbed by tins than by stunt? of the 
other factors. In the case of a particular measurement used, comparmoris are 
only made between mean readings said to have been obtained by following tv 
particular definition of the measurement. Even so, the personal equations of 
different measurers may have been large enough, in some cases, to disturb the 
results quite appreciably. It is not possible to investigate this matter in the 
case of all the material used, but it must be recognized that the disturbance due 
to it is probably quite sufficient to invalidate any rigid application of statistical 
rules to the material, Mistakes in sexing the crania provide another possible 
source of error, though probably this is one of minor importance. 
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It is not at all likely, of course, that the extent to which a particular coeffi¬ 
cient is distorted is determined by the accumulation of the various kinds of 
disturbing factors mentioned, One may hope that they will usually counteract 
one another. In view of all the circumstances, it is obvious that uo great reliance 
can be placed on the numerical accuracy of any particular reduced coefficient 
of racial likeness, or on the comparison of any pair of the constants. It would 
be most inadvisable, for example, to compare the difference between any pair 
with regard to the probable error of this difference. It is not unreasonable, 
however, to accept the coefficients as indicating different, rather broad, grades 
of resemblance, and it is in this sense that they are used for purposes of classi¬ 
fication here, in conformity with previous biometric practice. 

The classification of the series treated in this section is based on reduced 
coefficients of racial likeness for pairs of the male series: the female series are in 
most cases shorter, but they may be expected to lead to very similar conclusions, 
Corresponding mule and female values for the Lachish compared with six other 
series are given in Tabic XI. For the first of these comparisons the difference 

TABLE XI 


Corresponding male and female reduced coefficients of racial likeness for 
comparisons of the Lachish with other series* 



Abydos 

18th dyn. 

Thebes 

18th—20th dyn. 

Abydos and Hou 
12th—15th dyn. 

cj (40-0) 9(60-1) 

(83*8) 9(43-6) 

<J (66-9) 9(87-4) 

Lachish 

(? 

(184-0) 

V 

(147-1) 

1-20+0-32 

(14) 

1-05 ±0-28 
(14) 

2-13+ 0-23 
(24) 

0-37+0-20 

(24) 

4-02 ±0-26 
(14) 

2-61 + 0-23 
(14) 




Egyptian E 
2(ith-30th dyn. 

Denderah 

Korina 



Oth-12th dyn. 

I2th~13th dyn. 



,S (860-6) V (600-7) 

,J (108-0) 9 (140-4) 

H (107-6) $(84-2) 


o' 

6-74 + 0-00 

0-57 ±0-14 

7*86 ±0-13 

Lachish 

(184-0) 

9 

(29) 

(14) 

(20) 

0-87 + 0-08 

8-67 ±0-17 

12-06 ±0-17 


(147-1) 

(20) 

(14) 

(29) 


* The series compared in this table witli the Lachish, are ones for which fuller particulars are 
given in Table VIII. The numbers in brackets after the sex signs are the »’s for the numbers of 
skulls on which the means used in calculating the coefficients are based. The Ft’s given for the 
Lachish series arc for the twenty-nine characters on which two of the comparisons are based, and 
its values for the other groups of characters are very close to these. 
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between the male and female coefficient in oi no account, but lor some, oi the 
others the difference is appreciable, even in the case of the Lachish compared 
with the Egyptian E series, for which all the means involved are based on an 
adequate number of skulls. The estimates of resemblance are in fact rather 
different for the male and female comparisons, though there is a fairly dose 
agreement in their orders, except in the case of the comparison with the lHth- 
20th dynasty series from Thebes. The means for this series suggest that the 
groups of male and female skulls composing it may not have represented pre¬ 
cisely the same population, and any slight discordance of this kind is likely to 
have an appreciable effect on the comparisons considered. Errors in sexing are 
also likely to affect them to some extent. We are again led to the. conclusion that 
there is no justification for attaching any importance to small differences bet ween 
coefficients of racial likeness. 

The conclusions regarding the racial affinities of the populations represented 
by the twenty-two Ancient Egyptian and allied series may now be considered. 
Reduced coefficients of racial likeness between all pairs of them are given in 
Table VIII, and the arrangement suggested by the lowest orders of these con¬ 
stants is shown in Eig. 3. Three sets of series, representing races in other parts 
of the world, have been treated in the same way: the results for a North American 
set have been published by von Bonin & Morant (1938), for an Asiat ic set by 
Woo & Morant (1932), and for a European set by Morant (11)28). For this last 
material crude coefficients of racial likeness only are given for forty-one series, 
four of which relate to North African peoples. I am indebted to Morant for the 
corresponding reduced coefficients, which ho has calculated hut which have not 
been published. The four North African series were excluded, so the numerical 
results to be considered refer to thirty-seven European series. 

It is of interest to compare the distributions of the reduced coefficients for 
the new Egyptian and the three other sets. These are given in Tabic XI I, There 
are marked differences between the ranges of the constants for the four sets of 
material. On the average the Egyptian shows much lower values than the other 
three sets, the highest reduced coefficient for it being 47-5, white for each of 
the others more than one quarter of the coefficients exceed 30 and several values 
greater than 100 are found. 

The relationships of the modern races of man are of such a kind that they 
can be roughly divided into groups, so that all the members of a particular 
group show close resemblances, while there are links between the different 
groups. This conception of constellations of races, os it were, which are joined 
to one another is undoubtedly useful, though it is often difficult to give precision 
to it, owing to the existence of subgroups within the major groups. The distri¬ 
butions in Table XII suggest that the selected set of twenty-two Ancient 
Egyptian and allied series may be considered to represent a single constellation 
of populations which had very similar types, while each of the other three sets 
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represents two or more constellations. The Egyptian series actually relate to 
peoples who lived from early predynastic to modern times, a span ot about 
7000 years, hut there is no suggestion that the group changed radically within this 
period either as the results of evolutionary change, or owing to infiltration by 
alien peoples. The area represented extends from Abyssinia to Palestine, though 
nearly all the material comes from Egypt. 

TABLE XII 


Distributions of reduced coefficients of medal Ukcnmfor 
various group of male, cranial series 



No. 



(Jocftioimtn of racial IikoiK‘88 

s 

j Total 

Groups of series 

of 






r - ___; (torn* 






2(H) 40(1 ; 400 430; 1MU " lwm “ 


series 

<5 

5-13 

13-50 

50-100 

100-200 

Ancient Egyptian 

22 

53 

103 

75 


„„„ 

1 5 

i - ! 231 

and allied 


(22*9%) 

(«•<)%) 

(32-5%) 



1 

European 

37 

33 

102 

362 

135 

34 

i 066 


(5*0%) 

(lfi'3%) 

(54-4%) 

(2041%) 

(51%) 

1 j 

North American 

16 

2 

9 

59 

45 

5 

, 120 

Indian 


(1-7%) 

(7-5%) 

(49-2%) 

(37‘5%) 

111) 

(4*2%) 

k 1 „ i 

Asiatic 

26 

7 

23 

102 

56 

211 i 1 j 325 



(2'2%) 

(71%) 

(31-4%) 

(33*8%) 

m%) 

(H'0%) ! (<Ki%) 

1 l 


It is known from experience that in the cross comparisons of two groups of 
cranial scries, representing two distinct families of ratios (A and B), some lower 
reduced coefficients will he found between them than some for pairs of series 
belonging to group A, or than some for pairs of series belonging to group B, In 
comparisons of the Ancient Egyptian with North American Indian series, for 
example, it would be anticipated that a certain proportion of the coefficients 
would be below 50, though the majority would doubtless exceed this limit. The 
peculiarity of the generalized measure of resemblance employed must be sup¬ 
posed due to the peculiar nature of the material, and it need not be taken to 
indicate any defect in the method. A classification based on such generalized 
estimates of resemblance must evidently proceed on certain lines, and tins way 
in which the constants can most usefully bo employed for the purpose has to be 
determined empirically. 

Some of the cross comparisons Ax £ referred to give lower reduced coeffi¬ 
cients than some for pairs of A’s, or some for pairs of B' s, but at the same time 
they have always been found to exceed a certain limit, so that the lowest order 
of coefficients are not represented by any AxB comparisons. This suggests 
that any classification of the material ought to be derived solely from coefficients 
less than the limit in question, i.e. from the evidence of close resemblance only, 
such as is never found, as far as experience goes, between any two populations 
belonging to distinct families of races. 
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The earlier studies of groups of cranial series treated by the method of the 
coefficient of racial likeness suggested that the limit in question, might be taken 
as a reduced coefficient of about 20, so that the classification would be based on 
all values less than 20, while all greater than this limit would be neglected 
entirely, Recent experience of the cross comparisons between North American 
Indian and Asiatic series (von Bonin & Morant, 1038) has shown that it is 
safer to employ a considerably lower limiting value, which was provisionally 
taken at 13. If the particular set of material treated is made up by a consider¬ 
able number of very closely related series, a clearer picture of the interrelation¬ 
ships is obtained, however, by taking a still lower limiting value. 

In the present case the limit was chosen arbitrarily at 5-0, that is to say the 
evidence presented by all reduced coefficients less than fvO is taken into account 
—bringing in 23 % of the total of 231 comparisons—while all values greater than 
C*0 are neglected. It should be realized that a reduced coefficient less than 5*0 
represents very close similarity of the cranial types compared, such as has 
hitherto only been found in eases where close relationship between the popu¬ 
lations represented was anticipated, from the cultural (historical or archaeo¬ 
logical) evidence. 

The grades of resemblance indicated by reduced coefficients of racial likeness 
less than 5*0 are shown in Fig. 3. The full lines represent “insignificant” coeffi¬ 
cients—i.e. values which differ from zero by less than 3-5 times their probable 
errors. In these cases the two series might have represented precisely the same 
population, as far as can be seen from the direct comparison, though, owing to 
the imperfection of the material, no stress can be laid on the difference between 
an “insignificant” coefficient and one of the same order which indicates clear 
differentiation judged by the ratio of the constant to its probable error. The 
second grade of resemblance is represented by coefficients which differ signi¬ 
ficantly from zero, but which are less than 3*5, and the third by values between 
3*5 and AO, 

It should ho stressed that any coefficients of racial likeness less than A0 
indicate a very close resemblance of the cranial types compared. There are two 
adequately long male series of seventeenth-century London skulls, from White¬ 
chapel and Farringdon St, and the coefficient betweon them is 3*8 (see Morant & 
Hmwlloy, 1931, p. 233), so a value of this order can only be taken to show the 
degree of divergence which may be found between parochial subgroups of the 
same population. 

The arrangement shown in Fig. 3 was arrived at by placing the series, as far 
os possible, in position relative to one another so that their distances apart are 
proportional to the estimates of resemblance considered. One of the series 
treated—viz. that of the 9th dynasty skulls from Sedment—is not shown, 
because the lowest reduced coefficient found with it is 5*1 (with the 4th~5th 
dynasty series from Deshasheh and Medum). 
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There is a clear suggestion of a division into two groups. 

Group A includes eight aeries from sites in Upper Egypt (covering a stretch 
of the Nile, about 100 miles in length, from Badari in the north to Thebes in 
the south—see Fig. 2), a series from El Kubanieh (100 miles south of Thebes), 
and a Nubian series from Kerma. The. period represented by these ton series 
extends from early predynastio times to the IHth dynasty. 

Group B includes seven series from the same area in Upper Egypt, two from 
Lower Egypt, one from Abyssinia, and the Lachish from Palestine. The Upper 
Egyptian series here range in time from the 1st dynasty—only represented by 
a single series of skulls from Royal Tomlm—to Roman times. The two Lower 
Egyptian series and the Lachish (of about 25th dynasty date.) fall within the 
same period, and the Abyssinian series is of modern date. 

The population of Upper Egypt is represented by fifteen of the twenty-two 
series, representing a time sequence from early predynastio to Roman t imes. All 
the series of earlier date than the 18th dynasty except one fall in group .1. and 
show very close interconnexions. At the same time, there, is a clear suggestion 
that the type was changing gradually with time, the earliest pmlyitiwfie 
(Badari) series being at one extreme, and the middle dynastic series at the other. 
The only aberrant Upper Egyptian series of a date prior to the. IHth dynasty 
is one from Royal Tombs at Abydos, and it may be supposed that thesis repre¬ 
sent an intrusive group coming from Upper Egypt. The, fact that tins early 
Upper Egyptian typo persisted almost unchanged until the IHfh dynasty is 
evidenced by the series of this date from Sheikh Ali. 

There are four Upper Egyptian series of the 18th-*21»t dynasties.two from 

Abydos and two from Thebes—which are quite distinct from the Upper Egyp¬ 
tian (A) group of series, and which must hence he supposed to represent mainly 
an intrusive population in the region. Immediately before the IHth dynasty 
the whole of Egypt was under the Hyksos dominion, and when this lapsed at 
the end of the 17th dynasty the country was in an unsettled state. It may be 
noted that if there were immigrants into Upper Egypt at this time, they were 
most likely to have been of the ruling classes, who would have settled and been 
buried in the principal towns in Upper Egypt, via. Abydos and Thebes, The 
population of Upper Egypt in later times is only represented by two series from 
Denderah, one of Ptolemaic and the other of Roman date. The types of these 
two are very similar to those of the presumed intrusive groups of the IHth AM 
dynasties, though they stand somewhat closer to the indigenous Upper Egyptian 
types. This may obviously he explained as being due to some slight degree of 
intermixture between the intrusive group and the settled population of Upper 
Egypt between the 18th dynasty and Ptolemaic times. 

If an explanation of this kind is on the right lines, then it becomes necessary 
to discover the source of the people who are presumed to have migrated to 
Upper Egypt about the time of the 18th dynasty. The relationships of the 
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cranial series in Fig, 3 provide a clear answer to this question, as they show 
intimate connexions between two late Upper Egyptian series and two of 
the three Lower Egyptian series available, viz. the 4th-5th dynasty series from 
Deshasheh and Medum, and the 2fith-30th dynasty from Gizeh, forming with 
them what is here called the B group. 

Unfortunately, the evidence available is quite inadequate to provide an 
outline of the racial history of Lower Egypt. A third series from that region, 
viz. one from Sedment of 9th dynasty skulls, is not shown in the diagram, 
because its lowest reduced coefficient exceeds 5-0; this is 5-1 with the 4th and 
5th dynasty series from Deshasheh and Medum, sites close to Sedment. The 
three Lower Egyptian series are very similar to one another, and they stand on 
the same side, as it were, of all the Upper Egyptian material. Another factor 
which has to be taken into account in interpreting the evidence is that the Upper 
Egyptian type was apparently becoming gradually modified, prior to the 18th 
dynasty, in the direction of the Lower Egyptian type. The evidence suggests 
the following general conclusions: 

(1) It may be supposed that originally the populations of Upper and Lower 
Egypt formed two distinct groups, the purest representatives of these known 
being the Early predynastic from Badari, and the 4th—5th dynasty series from 
Deshasheh and Medum. These are the earliest series available from Upper and 
Lower Egypt, respectively. For convenience, the groups may be called, following 
Morant, the Upper (corresponding to our Group A), and the Lower (Group B) 
Egyptian types. From the earliest times until about the end of the 17th dynasty, 
the type of the Upper Egyptian population became gradually modified in the 
direction of that of the Lower Egyptian. This may be supposed due to a gradual 
infiltration of Lower Egyptians into Upper Egypt. The relationships of the 
Kerma series from Nubia are of particular interest in this connexion. It is of 

12th.13th dynasty date, but its closest connexions are with two predynastic 

series from Upper Egypt. Hence it; may be supposed that the Kerma people 
represent the descendants of colonists who left Upper Egypt in predynastic 
times, and that this stock remained stable, and was not modified by inter¬ 
mixture, though the parent group itself was changing owing to contacts with 
the North. 

There is only one Upper Egyptian series of earlier date than the 18th dynasty 
which stands apart from the constellation formed by all the others. This is of 
1st dynasty skulls from Royal Tombs at Abydos, and as their type is very 
similar to the Lower Egyptian, it may be supposed that they represent an 
intrusive group from that region. This group, which was probably small, may have 
been absorbed in the Upper Egyptian population without affecting the type of 
the latter appreciably. 

(2) The situation in Upper Egypt appears to have changed radically in the 
18th dynasty. There is one Upper Egyptian series of this date (from Sheikh Ah) 
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which is very similar to the earlier series from that region, hut foui others 
(ranging from the 18th to the 21st dynasty) stand quite apart and show dose 
relationships with the Lower Egyptian series. The movement from Lower to 
Upper Egypt appears to have been greatly accelerated about the time of the 
18th dynasty. The evidence suggests that at this time it virtually became a 
peaceful invasion of Upper Egypt, which resulted in the almost complete dis¬ 
placement of the earlier population there. This view is supported by the fact 
that there are no series whatever later than the 18th dynasty of the early 
Upper Egyptian type, while two late series from Denderah are still of Lower 
Egyptian type, though somewhat closer to the early Upper Egyptian than are 
the 18th—21st dynasty series from Thebes and Abydos. These relationships 
suggest that the prevailing population in Upper Egypt after the 18th dynasty 
was of Lower Egyptian origin, and that it became mixed to some extent with 
descendants of the earlier population of the region. 

The fact that a modern population from the north of Abyssinia stands 
between the two Ancient Egyptian groups is interesting, but far more evidence 
would obviously be required to elucidate the significance of this relationship. 

The relationships of the Lachish series may now be considered. All its 
closest connexions are with scries of the Lower Egyptian type, and these are 
close enough to suggest that the population in the Palestinian town was entirely, 
or almost completely, of Egyptian origin. The skeletal material from Luchish is 
believed to represent people who died about 700 ius., which is the time of the 
25th dynasty in Egypt. The series shows closest resemblance, however, not to 
the 26th—30th dynasty series from Gizch, but to the early dynastic series from 
Lower Egypt (Deshasheh and Medum), and to three Upper Egyptian series of 
18th dynasty or later dates which are assumed to represent populations of 
Lo.wer Egyptian origin. There is also a rather less close connexion between the 
Lachish and one of the Upper Egyptian types (Abydos and Hou, 12th - 15th 
dynasty). These relationships suggest that the,Lachish population represents 
descendants of a colonizing group of men and women, which was derived 
primarily from Upper Egypt, at some time later than the 18th dynasty, and 
which maintained its typo unchanged—free from intermixture until 7(H) u.c, 
or later It is not at all unlikely, of course, that a colonial group in Palest,hut, 
such as the one which settled in Lachish, originally included people from several 
parts of Egypt, and not from a single restricted locality. The characteristics of 
the Lachish sample give no hint of heterogeneity, however, which may be due 
to the fact that any diversity which originally existed in the group was obscured 
by intermarriage within the community for several generations. The evidence 
suggests quite clearly that the Lachish people were derived principally from 
a population of Upper Egypt which was itself derived principally from emigrants 
who left Lower Egypt about the time of the 18th dynasty. 
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7. Comparisons op the Lachish and Ancient Egyptian 

SERIES FOR CHARACTERS CONSIDERED SINGLY 

Comparison of the Lachish and a number of Ancient Egyptian and allied 
cranial series, based on the averages for a number of characters considered in 
conjunction, have been made in the preceding section. The means for the same 
material will now be compared for characters considered singly, 

A convenient way of summarizing the statistical comparisons of means for 
groups having a number of features recorded, is provided by considering the 
a’s calculated in computing the coefficients of racial likeness. An a is an approxi¬ 
mation to the square of a quantity which is the difference of the means divided 
by the standard error of this difference. If an a is greater than 10, it may be 
supposed that the difference between the two means is clearly significant. The 
percentages of a’s greater than 10 may be used to distinguish characters which 
are practically constant for all the groups from those which frequently denote 
differentiation. 

The twenty-one comparisons between the Lachish and each of the other 
scries may bo considered first. The numbers of a’s available for this set range for 
the different characters from seven to twenty-one, only fourteen of the thirty- 
one coefficient of racial likeness characters being available for all the series.* 
These fourteen may be considered first, and they give the following grouping: 

(rt) characters showing a high proportion of significant differences: 

B (percentage of a’s > 10 = 47-0), 100 B/L (424)), and 100 BjW (42-9); 

(/;) characters showing a lesser proportion of significant differences: 

Nl, (28-0), H' (23'8), (23-8), J (19 0), and 100 11% (19-0); 

(c) characters showing few significant differences: 

L (14-3), 100 NBjNH (9-5), At (9-0), G'H (4-8), NB (4-8), and LB (0-0). 

It is clear that some characters can be supposed practically constant for all 
the series, while others show many significant differences. 

'I’he remaining characters which can be treated in the same way are only 
available for numbers varying from eight to twelve of the twenty-two series. 
'The numbers of comparisons are very restricted for those, but they suggest 
that .S’, O it G' v 100 O'HjOB, 100 G,JG' V and PL are practically constant for all 
the scries, while the Lachish is most frequently differentiated by B', Oc.L, fml, 
fruit, U, and rather less frequently by G t> () v 100 0 2 /() l and 100 fmb/fntl 

Table XIII gives the ranges of certain male means for the Upper Egyptian 
group of series (i.e. the tenon the left-hand side of Pig. 3), the means for the Lachish 

* The transverse arc {(!Q') for the Lachish series is not available for any of the others, and the 
estimated cranial capacities were not included, so the maximum number of characters used in 
computing coefficients with the Lachish series is 29. 
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alone, and the ranges for the remaining nine comprising the Lower Egyptian 
group (i,e, all on the right-hand side of Fig. 3 except the Lachish and Abys¬ 
sinian). There is seen to be complete separation of the ranges for the two sets 
of series, with the Laohish falling within the Lower Egyptian limits, in the case 
of B, B', U, and 100 B/1J', and a close approach to the same condition in the ease 
of J and 100 BjL. All these measurements are transverse breadths, or indices 
including a breadth, and there is no doubt that the values of the reduced 
coefficients of racial likeness are largely determined by differences between 
them, Owing to the use of correlated measurements, the differences in cranial 
breadth affect the generalized measure of resemblance unduly. This factor was 
apparently showing far more significant variation in Ancient Egyptian popula¬ 
tions than any other relating to the cranium. 


TABLE XIII 

Ranges of mean measurements for two group of Ancient Egt/pHnn 
male skulls and the LacMsh means 



Series 

Period 

U 

m HjL 

loo MV 

Upper Egyptian type 
Laohish 

Lower Egyptian type 

Early predyn.-lfith dyn. 
cu, 25th dyn. 

1st dyn.-Roman 

_ 

500-0-510-4 (4) 
filH-1 

510-8-518-7 (5) 

71-7 73-7 (10) 
7-1-3 

73-7 71MI (11) 

. ___ 

HH'l ll/l-l (Hi) 
102-4 

102-3 illfl-4 (!)) 



Upper Egyptian typo Early predyn.-ISth dyn. 182-2 185-2 (10) 
Lachiali m, 25th dyn. 184-5 

Lower Egyptian type 1st dyn,-Roman 181-4-1H6-H (0) 



Laohiah ca. 25th dyn. 37-0 3 (].f, 

Lower Egyptian type 1st dyn.-Roman 36-1-37-0 (5) 29-7-30-2 (5) 

* The numbers in brackets indicate the number of series to which the ranees re 


00-2 112-3 

W 

69-5 


511-901-5 

(3) 
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The remaining characters treated in the table make no clear distinctions 
between the two sets of series, and the same situation is observed in the case of 
all the other coefficient of racial likeness characters. The Upper Egyptian series 
show a slight tendency to be more prognathous than the Lower Egyptian 
(judging by N/L), but the two ranges overlap appreciably. It may be noted that 
the foraminal length and breadth of the Lachish type are extremely large, and 
its occipital index is extremely low compared with those for the other series. 

The majority of the measurements not used in computing the coefficients 
appear to be fairly constant for all the Ancient Egyptian series, and the Lachish 
means fall within their ranges. This is. so far the bimaxillary breadth (OB), in 
spite of the clear differences found between the bizygomatic and calvarial breadths. 
The index 100 (B-H')jL does make a clear distinction between the Upper and 
Lower Egyptian sets of series, but this again appears to be due to the fact that 
a breadth measurement is involved. 

The simotie measurements, giving estimates of the “flattening” of the nasal 
bridge, are only available for a few series. Comparison with, the data for these 
given by Woo & Morant (1934), shows that the breadth of the nasal bones 
(*S'<7) is unexceptional for the Lachish type, hut the subtense (SS) and index 
(loo AS I SO) are decidedly larger for it than for the Badari, Kerma, Sedment, 
and an Ancient Nubian type. The greater curvature of the nasal bones in the 
Lachish skulls places them within the range found for European populations. 

Comparative material for the malar bone measurements is still more re¬ 
stricted. Comparison with the data given by Woo (1937) suggests that the 
Lachish means are unexceptional, but for the index measuring the curvature of 
a horizontal section of the bone (100 SfG, ml). For this the Palestine series lias 
a mean which is decidedly greater than those for the two Egyptian series, and 
this mean places it close to the extreme for the types from all parts of the world 
hitherto described. For the vast majority of measurements the Lachish skulls 
are not distinguishable as a group from series representing Ancient Egyptian 
populations. The most distinguishing features of the type appear to be the 
exceptional curvature of the nasal bridge and horizontal sections of tire malar 
bones. This divergence is suggestive, but more abundant comparative material 
would be needed to assess its significance. 

8, The contours or the Lachish series 

Contours of adult skulls in the Lachish series were drawn in accordance with 
the methods used in a number of earlier craniometric studies published in 
Biomelrika , and a selection of the total group had to be made for this purpose. 
Types were constructed from the measurements of these contours in the usual 
way (Figs. 4-9), and they relate to material from the four tombs combined 
(see p. 123). The selection was made by first excluding all specimens too in¬ 
complete to give the Frankfurt orientation, and then excluding from the 

Bfenneltika xxxi io 
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remainder those for which the horizontal and transverse sections arc defective 
to more than a slight extent. The totals included are 10ft male and HO female 
crania, and almost all the measurements used in constructing the ty pes (Tables 
XV-XVII) are available for every one of these specimens, except in the case 
of facial measurements of the sagittal contours for which the numbers vary 
appreciably. The type contours of the Lachish series are based on samples which 
are almost as large as any previously used for the purpose. 

A comparison is made in Table XIV between certain measurements of the 
sagittal type contours and corresponding caliper measurements for the total 

TABLE XI.V 

A comparison of mean caliper and type, contour measurements 


Character 

Male 

Female 

Contour 

CalijKir 

Contour 

Caliper 

L 

1H5-3 (108) 

184-5 (322) 

17K-K (89) 

170-8 (2511) 

IP 

135 0 (0(i) 

133-8 (2(18) 

127-0 (7-1) 

128-4 (213) 

*V 

113-0 (108) 

112-1) (299) 

110-2(811) 

108-7 (248) 

SJ 

11<M (108) 

11(1-0 (323) 

111-8 (80) 

112-1 (251) 

o 3 

Ofi'7 (07) 

9(1-3 (2H0) 

94-5 (79) 

94-0 (210) 

fml 

30-7 (9(i) 

37-0 (247) 

35-1 (74) 

35-8 ( H)3| 

116-8 (H7) 

O'li 

69-7 ((ill) 

70-1 (08) 

(111-2 (44) 

Qh 

94-2 ((12) 

1)4-3 (89) 

91-1 (38) 

9O-0 (70) 

LB 

101-8 (1)6) 

100-7 (243) 

97-1 (74) 

90-4 (2(H)) 

PL 

H7"-() (6(1) 

8(!'MI (81) 

811-5 (44) 

84 -9 (02) 

N/_ 

63"-4 (62) 

04'W) (89) 

65"-1 (38) 

04 -5 (75) 

al 

75°-2 (62) 

73"-9 (89) 

73SI (38) 

73 -7 (75) 

bl 

41°-4 (62) 

42°-0 (89) 

41°-3 (38) 

41"'-8 (75) 


series. The former are either used in the construction of the type (6*7/, 67,, US), 
or calculated from measurements used in its construction (//', .S'l, <S' a ', fml. 

Pi, Nl, Al and PL), or measured on the figure (/,). In comparing the means 
of the two kinds, it must not bo forgotten that one scries represented is a 
selected group of the other. It might be anticipated that the. process of selection 
described would favour the larger and stronger skulls, more likely to be well 
preserved, and hence rather larger means would he expected fur the contour 
series. The divergences between corresponding values are actually found to las 
very small. Of the nine absolute measurements the male contour mean exceeds 
the caliper in six cases, the largest difference being 1*2 mm., and the position is 
reversed for the other three, which show a maximum difference of 0-4 mm. For 
the same measurements the female contour mean exceeds the caliper value in 
five instances, the maximum difference being 2-0 mm,, and the maximum 
difference for chords showing the caliper greater than the contour mean is 






D. L. Risdon 


147 

O’8 mm. The angles show a close correspondence. On the whole there is a good 
agreement between the measurements obtained in the two ways, suggesting 
that the contours were drawn with sufficient accuracy, and that the sub-series 
gives a fair representation of the type for the total series. 


L A R 



log. 4. Transverse typo contour baaed on 10H undo Lachish skulls. 


The type contours (Figs. 4-9) have no striking peculiarities, and super¬ 
ficially, at any rate, they appear to be very similar to those given for Ancient 
Egyptian and even some European series. They are average in size, show an 
orthognathous facial skeleton, and only moderate muscular development. The 
male contours are larger than the female in all respects to the same extent as 
is usually found. 


id-2 
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Fig. 10 shows the three male Lachish types and those given by Miss Collett 
(1933) for the Kerma, and by Miss Stoessiger (1927) for the Badari series, super¬ 
posed. There is clearly a close agreement between the average contours for these 
two Ancient Egyptian and the Palestinian series, though the differences arc 
probably almost as great as those which would bo found between any pairs of 
the types of Ancient Egyptian series. The Kerma and Badari series are assigned 

L ‘ A R 



by their mean measurements to the Upper Egyptian group, and the Ltwhwlt 
belongs to the Lower Egyptian (see Fig. 3). The superposed types allow the 
greatest differences in calvarial breadth (transverse and horizontal sections), 
and decidedly smaller ones in calvarial lengths (horizontal and sagittal), as 
would have been anticipated, The Laehish facial skeleton is seen to be rather 
less prognathous than the Kerma or Badari, but the section of its nasal hones 
is the most projecting (cf. p. 145). The type for the predynastie series is the 
smallest in nearly all respects and particularly in facial height, 
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TABLE XV 

Mean measurements of transverse contours of the Lachish skulls 
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TABLE XVI 


Mean measurement of horizontal contours of the Lachish skills 
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TABLE XVII 


Mean measurements of sagittal contours of the Lachish skulls 




Ordinates above Ny 

Sex 

Ny 

0=N 

Ni 

1 

2 

3 

4 

5 

t 

182-3 (108) 



69-4 (107) 

72-5 (108) 

80-1 (108) 

84-2 (108) 

85-7 (108) 


177-0 (89) 

19-7 (89) 

■ 

37-6 (89) 

57-8 (89) 

89-8 (89) 

76-7 (89) 

80-5 (89) 

82-1 (89) 


Sex 

Ordinates above Ny 

0 

7 

8 

9 

ri 

ri 

r 

3 

86-9 (107) 

82-8 (107) 

73-7 (108) 

66-0 (108) 

20-0 (108) 

19-3 (108) 

0-24 (108) 

? 

82-2 (89) 

78-6 (88) 

69-9 (88) 

60-6 (89) 

24-4 (89) 

17-1 (89) 

5-0 (89) 


Sex 

Ordinates below Ny 

Vertex 

0=N 

Ni 

1 

2 

8 

9 

x from N 

y 

3 

65-9 (87) 

62-6 (95) 


63-7 (97) 

48-5 (100) 

38-7 (108) 

101*3 (108) 

80-4 (108) 

¥ 

62-1 (66) 

68-5 (79) 

52-2 (89) 

61-2 (82) 

47-3 (89) 

37-7 (89) 

04*5 (89) 

82-9 (89) 
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TABLE XVII (cont.) 

Mean measurements of sagittal contours of the Lachish shills (cont.) 


Aur. Pt. 


Opiathion 


x from N y x from y y * from y y From y From N 


100-3 (96) 101-8 (96) 
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9. Comparisons op the Lachish series with series op crania 

AND LIVING PEOPLE OTHER THAN EGYPTIAN 

The comparisons made above have shown that the Lachish series is very 
similar in type to several Ancient Egyptian series. In fact it bears as close a 
resemblance to some of these as they do in general to one another. Hence it 
may be concluded that the population of Lachish in the year 700 n.c. was 
primarily, at least, of Egyptian origin. The cranial evidence from other countries 
of the Near East is very meagre. Comparisons are made with the only other 
Palestinian series of any length, and with a few series of living people, in this 
section. 

A considerable number of human skeletons was excavated by Prof. R. A. 8. 
Macalister at Gezer, 17 miles south-east of Jaffa, from 1902 to 1905 and 1907 to 
1909. These are dealt with in a chapter in his report (1912). Material of the pre- 
Semitic period was very fragmentary, and no measurements of it could be taken. 
Five indices for a series of skulls of the Semitic periods are given in the form of 
frequency distributions only. Division is made between a series for the first 
and second periods together, and a series for the third and fourth periods 
together, in the easo of the cephalic index. A distribution of reconstructed 
statures is also provided. The material was apparently unsexed, and it is not 
clear how the absolute frequencies could be determined from the diagrams. 
All that can be said is that the average cephalic index for both series of unsexed 
skulls is about 76. The Lachish male mean is 74-3, and the female 75 , fl. 

There appear to be no published records for any other long series of skulls 
of any date from Palestine, or any neighbouring country except Egypt. Measure¬ 
ments of small numbers of ancient Jewish and Phoenician and modern Arab 
specimens have been given, but these are practically worthless for statistical 
purposes, There are no adequate data for Ancient Jewish skulls from any 
locality. The longest modem series representing this people is one published by 
Prof. J. Matiegka (1926) of seventeenth-century Jews buried in Prague. The 
average cephalic index for the fifty-three male skulls is 82*0, which is sufficient 
to show that there can be no close connexion with the Lachish people (74*3), 
or with any of the Ancient Egyptian groups, for which the highest index is 7 (Mb 
The coefficient of racial likeness was computed for twenty-two characters between 
the male Jewish series from Prague (n = 32-8) and the Lachish series {nm 202-H); 
a reduced value of 52-6 is found. This is greater than the maximum found 
between any pair of the series of Egyptian typo, including the Lachish (see 
Table XII). It is not suggested, of course, that the Palestinian Jews in 700 B.c. 
were necessarily of the same type as Jews in Prague in the seventeenth century. 

Measurements of a certain number of Jewish people in Palestine have been 
published, and only the cephalic index for series of men will be considered here, 
No accurate comparisons with cranial data can be given m the case of any 
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other characters recorded. Weissenberg (1909) gives a mean value of 79-8 for 
fourteen men in Galilee. All the remaining series of Jews in Palestine represent 
the Samaritans, the sources and mean indices being: 

Weissenberg (1009)—76-2 (20); Rappers (1034)—77-2 (27); 

Szpidbauin (1927)—77-6 (27); Huxley (1006)—78-1 (35); 

Gemia (1938)—79-1 (39). 

In view of the small sizes of the series, these averages accord remarkably well. 
The pooled mean for the 148 men is 77-9. 

Weissenberg (1909) has also given cephalic indices of 76-fi for twenty-five 
fellahin measured near Jaffa, and 75-7 for thirty in the Safed district, or a pooled 
mean of 76-2 for the fifty-five men. This value differs markedly from that of 
81-0 given by Rappers (1934) for 139 Arabs from “towns north of the line 
Jaffa-Jericho”. There appear to be some well-marked regional differences 
between Arabs in different localities in Palestine. 

I he mean cephalic index for the male Laoliish series of skulls is 74'3, which 
can ho supposed to correspond to a value of about 7(1*3 in the living. The latter 
is decidedly less than the mean for one series of Arabs, rather less than the 
Jewish means, and practically identical with that for the Arabs measured by 
Weissenberg. It is quite unsafe to lay stress on comparisons of a single character, 
but those made show that a population with the same cephalic index as that of 
the ancient inhabitants of Lachish is living in Palestine to-day. The possibility 
that the pre-Christian type has persisted until modern times is not precluded, 
though tar more evidence—and particularly that of later series of skulls from 
the country—-will be needed to disclose its racial history in any detail. 

10. Mandibles and long bones of the Lachish series 

In all there are seventy-six mandibles in the Lachish series—fifty-six from 
fomh 120 and the remainder from the three other tombs—most of them being 
defective to some extent. Of the total, one is associated with an adult male 
cranium, and nine with adult female crania. The remaining sixty-six are un- 
iWHoeiatcd, anti they were sexetl by anatomical appreciation. Remarks on a few 
of the specimens noted as being anomalous are in § 4 above. Measurements 
were taken of the adult bones in accordance with the biometric technique 
(Morant, (iollett & Adyanthaya, 1930). There are thirty-four male and thirty-five 
female specimens. Means for these are given in Table XVIII, and comparisons 
with other material would not be profitable, as it has been shown (Cleaver, 1937) 
that considerably larger numbers would be required to reveal small racial 
differences in type, 'Neither the measurements nor the appearance of the 
Lachish mandibles suggest any clear divergence from Ancient Egyptian types, 
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TABLE XVIII 


Mean measurements of Lachish mandibles 


Character 

Male 

Female 

Character 

Male 

Female 

w i 

118*6 (7) 

114*9 (7) 

cji 

04*3(11) 

00*0(10) 

cJ 

20*5 (17) 

19*4(18) 

mfi 

20*9 (14) 

24*5 (14) 

rV 

32-6 (2(5) 

29*9 (28) 

ml 

125 r M) (14) 

124"*5 (15) 

rasp. 

27*4 (26) 

27*2 (23) 

RL 

7r*0 (5) 

72°*1 (9) 

K 

34*8 (20) 

31*8(19) 

CL 

74 c (> (11) 

72' 3 -7 (13) 

» 

44*8 (31) 

43*2 (33) 




eJ 

73*1 (14) 

70*1 (15) 

100 c r hjml 

60-7 (5) 

00*5 (0) 

rl 

59*75 (14) 

54*9(15) 

100 

8(1*0 (4) 

90*5 (5) 


97*1 (11) 

85*5(11) 

100 MoM 

133*3(11) 

122*9(11) 

G/fCf 

91*1 (4) 

92*1 (8) 

100 rb'jrl 

54*7 (14) 

63*1) (14) 

ml 

104*8 (8) 

100*3 (8) 

100 M 0 /V r 

1014 (4) 

92*1 (6) 


Other parts of the skeleton are only represented in the Lachish series (all 
tombs) by two sacra and nearly 200 long bones, many of which aro incomplete. 
These are not associated together and no attempt was made to sex them, The 
maximum lengths of the adult long bones were determined, and means for them 
are given in Table XIX. As far as can be seen from these constants, the Lachish 
people were rather short, but no approximation of any value to the average 
statures of the men and women can be given. One ulna (No. 2) has a healed 
fracture of the lower shaft, and one femur (No. 3b) has condyles affected by 
arthritis, 

TABLE XIX 

Means of the maximum lengths of unsexed adult long 
bones of the Lachish series 


1 

Femora 

(obliquo) 

Tibiae 

(obliquo) 

Humeri 

(oblique) 


Ulnae 

Clavicles 

R 



298*5 (11) 

243*7 (0) 

204*0 (5) 

147*25 (4) 

L 

430*45 (20) 

372*25 (8) 

300*1 (16) 

239*8 (5) 

2(10*0 (0) 

154*0 (l) 


11. Summary and conclusions 

The skeletal remains reported on in this paper for the Trustees of the late 
Sir Henry Wellcome were collected at Tell Duweir (Lachish), twenty-five miles 
south-west of Jerusalem, by the Wellcome-Marston Expedition to the Near 
East from 1933 to 1936. The bones were found in four adjoining tomb chambers 
and they are assigned to the seventh and eighth centuries B.c. In all there are 
695 crania, the majority of which are more or less imperfect, and much smaller 
numbers'of mandibles and other bones of the skeleton (see table on p. 103). 
Of the crania 360 were judged to be adult male and 274 adult female, the 
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remaining sixty-one being immature. The origin of the collection is discussed, 
and it is concluded that the remains are probably those of people who died as 
the result of some catastrophe. The frequencies of occurrence of different states 
of closure of the principal calvarial sutures show, from comparisons with other 
cranial series, that the adults from Lachish were younger, on the average, than 
cemetery populations are expected to be. Very few aged individuals were 
interred in the tombs, The normal order of closing of the sutures was sagittal- 
coronal—lambdoid, Remarks on unusual conditions and anomalies are given, 
the most interesting specimens being three trepanned skulls; two showing marked 
artificial deformation and six suspected to have been deformed artificially; a 
series of seventeen showing premature closing of the sagittal suture without 
dear deformation except in three cases; one believed to be distorted owing to 
premature closing of the coronal suture; one with absence of the right auricular 
passage; and one with an extensive diseased area on the vault. 

Statistics regarding the loss of teeth before death show that they were 
remarkably well preserved. Remarks on dental anomalies are given. The most 
interesting skull from this point of view is one which was found to have a metal 
filling in one of its molars, presumably acquired by accident. 

Judging from comparisons of the measurements, there is no reason to doubt 
that the scries from the four tombs represent precisely the same population, and 
the differences found between the male and female adult and juvenile constants 
are no greater than those expected in such a case, The variabilities and sox 
ratios of the total series (combining skulls from all tombs) are quite unexceptional. 

Comparisons are made between the Lachish and twenty-one Ancient Egyp¬ 
tian and allied series of skulls by the method of the coefficient of racial likeness, 
and a classification of the material is presented. The relationships found suggest 
that the population of the town in 700 b.c. was entirely, or almost entirely, 
of Egyptian origin, very close connexions with some, contemporary Egyptian 
series being found. They show, further, that the population of Lachish was 
probably derived principally from Upper Egypt. Comparisons of measurements 
considered singly indicate that the Lachish cranial type lias no features which 
would bo unusual for an Ancient Egyptian type, other than the prominence of 
its nasal bones and the curvature of its malar bones. Transverse, horizontal, 
and sagittal type contours based on IDS male and eighty-nine female Lachish 
skulls are given, and it is shown that they are very similar to some previously 
provided for Ancient Egyptian series. There are no good records for any series of 
skulls from Palestine other than the Lachish. Its mean cephalic index accords 
fairly well with that given for a series of living Palestine Arabs, and it is close 
to that for Samaritans. The possibility that the Lachish people have persisted 
until to-day is not precluded, but far more evidence would be required to sub¬ 
stantiate such a hypothesis. The series of mandible and long bones from Lachish 
are too small to be of any value for comparative purposes, As far as can be seen 
the people were rather short. 
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APPENDICES 

I. Definitions of skull measurements taken 

Measurements of tho Lachish material were taken in accordance with biomofcrio prac¬ 
tice, The definitions of cranial points given by Buxton & Morant (1933), and those of man¬ 
dibular measuromonts given by Morant, Collett & Adyanthaya (1930), wore, followed. Tho 
contractions bolow are used to denote measurements in the tables and text, and their 
numbers in Martin’s list are given. 

C = capacity in c.c. It was not possible to determine tho capacities of any of tho Lachish 
skulls directly, owing to the fact that tho interiors of the brain-boxes are coated with mud 
and wax which cannot bo removed. Tho reconstruction formulae using L, It, and IP given 
by Pearson & Stoessiger (1927) were applied to give tho estimates in Table VI, and these 
were not used in computing coefficients of racial likeness. L — maximum glabella-occipital 
longth (M. 1). B = maximum horizontal breadth (M. 8). H' = bosio-bregmatic height 
(M. 17). LB = basion to nasion (M. 5). B' - minimum frontal breadth (M. 9). >V — are 
nasion to opisthion (M. 25). S 1 = arc nasion to bregma (M. 20). iS' a = arc bregma to lambda 
(M. 27). S s = arc lambda to opisthion (M. 28). S[ — chord nasion to bregma (M. 29). 
S' t = chord bregma to lambda (M, 30). S'f = chord lambda to opisthion (M. 31). U ~ hori¬ 
zontal circumference measured through the ophyron and direotly above tho superciliary 
ridges (M. 23 a). jtQ' = transverse circumference from one auricular point to the other, 
passing through brogma (M. 24), fml = basion to opisthion (M. 7). ftnb = maximum breadth 
of foramen magnum (M. 10). Q'H = nasion to alvoolar point (M. 48). (IL — Imsicm to 
alveolar point. OB = facial breadth between lowest points on zygomatic-maxillary sutures 
(M. 46). J = maximum breadth between zygomatic arches (M. 46). NH, L •-= nasion to 
lowest edge of pyriform aperture on the left side. NB - maximum breadth of pyriform 
aperture (M. 64). 0 X L — maximum breadth of loft orbit (M. 61). O s L ~ maximum height 
of left orbit (M. 62). 0[ = longth of palate from orale to staphylion (M. 02). (\ - breadth 
of palate between inner alveolar walls of second molars (M. 03). OH « maximum projec¬ 
tion from biporial axis in tho transverse vertical plane, measured on transverse contour. 
/S'C-simotic chord, minimum breadth of nasal bones (M. 67). AV-subtense of simotio 
chord. Measurements of the left malar bones, taken in accordance with Woo’s instructions 
(1937), are; Ml x = minimum horizontal arc. Ml t — minimum vertical arc. C(ml) as chord 
between terminals of horizontal arc. S(ml) - maximum subtense from the chord. The 
ocoipital index is the only one which needs definition: it is 



Values for tho individual skulls wore found with the aid of Miss Tildcsloy’s table of 
the funotion (Biometrika , 13 (1921), 261-2). PL - profile angle between Frankfurt 
horizontal plane and the chord joining nasion to alveolar point. NL, Al, and HL are 
tho angles of tho triangle of which the nasion, alveolar point and basion are the apices. 
w l =maximum breadth outside condyles, o v l = maximum length of tho loft condyle. 
rb' =minimum antoro-postorior “breadth” of the left ramus, = chord between the 
points on the outer left alveolar margin from the middle of tho second molar to the middle of 
the first premolar. h x = symphyseal height from intradental to the point farthest removed 
from it in tho symphyseal plane, zz ~ minimum ohord between tho (Ulterior margins of 
the right and left foramina menlalia. c r e r = coronial breadth from right coronion to 
left coronion. ML- mandibular angle. c v l = projective length of tho corpus. ri!= pro¬ 
jective length of the left ramus. g 0 g„ = chord from left gonion to right gonion, ml — maxi¬ 
mum projective length of the mandible, efi = projective height of the left coronoid process. 
mji = projective height of the corpus at the middle point of the outer alveolar margin 
of the second left molar. RL = angle of condylar-coronoidal line with ramus tangent. 
0 L — angle between the standard horizontal plane and tho line joining tho infradontal to 
the most anterior point in the standard sagittal plane of tho symphysis. 
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LIST OF PLATES 

Plato I. Views of the interior of Tomb 120. 

A. Entrance of tomb showing skulls round side. 

B. Interior of tomb with skulls collected round side. 

Plato II. Tomb 120, collections of skulls in situ. 

A. Skulls collected round wall. 

B. A closer view of skulls seen to the left of the beam in Plate Ia. 

Plato III. Exceptional crania. 

A. A female skull (No. 485) showing extensive burnt patch. 

B. A male skull (No. 108) showing injuries probably inflicted shortly before death: see p. 104. 

0, D. Male skulls (No. 156, Tomb, 107 and No. 179, Tomb 116) of exceptional type; no other 

spooimons of this typo wore found. 

Plato IV. A male cranium (No. 340) with hole in right parietal, probably a trepan, and a sword-out 
on the same bone dose to tho coronal suture. 

Plato V. Male crania with trepanned openings. 

A. No. Ilfi. There is a septic area round the trepan in the right parietal. A cut probably made by 
tho trepanner can bo aeon by tho lateral outline of the left parietal. 

B. No. 114. 

Plate VI. Crania showing marked artificial deformation. 

A. A male oranium (No. 381). 

B. A female cranium (No. 673), 

Plato VII. Norma lateralis views of five male crania (A-E) believed to bo artificially deformed to 
a slight extent, and of a male cranium (E), probably deformed owing to premature closing of 
tho coronal suture. A, No, 378; B, No, 375; C, No. 377; D, No. 376; E, No. 379; E, No. 380, 

Plate VIII. Noma vertioalis viows of threo anomalous crania. 

A. A male cranium (No. 380) with coronal suture obliterated. A norma lateralis view of this 
deformed specimen is shown in Plate VIIB. 

B. A female cranium (No. 464) with healed injury on loft side of the frontal bone. 

C. A motopic female cranium (No. 670) with wound on left parietal bone and the Bagittal suture 
prematurely obliterated. 

Plato IX. Norma lateralis views of a typical female (No. 388) and a typical male (No, 28) oranium 
of the Laohish series. Other views of these two specimens are shown in Plates X-XII, They 
were selected from among the more complete skulls on account of tho fact that their measure¬ 
ments of shape show no marked divergences from the means for tho series. In tho ease of 
No. 388, none of these measurements differ from the fcmalo mean by more than the standard 
deviation of the distribution. Tho same is true for No. 28 in comparison with the male moans. 
All the photographs of the typical skulls are approximately 0-6 natural siro (linear dimensions). 

Plate X. Norma facialis views of a typical female (No. 388) and a typical male (No. 28) oranium 
of the Laohish scries. 

Plate XI, Noraiu rcrticnlis views of a typical female (No. 388) and a typical male (No. 28) cranium 
of tho Laohish series. 

Plate XII. Norma occipitalis views of a typical female (No. 388) and a typical male (No. 28) 
cranium of tho Laohish series. 


II-Z 
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Plate XIII. Crania showing premature obliteration of the sagittal suture. 

A. A female cranium (No. 072) showing complete obliteration of the. suture with distortion. 

B. A male cranium (No. 3(W) showing eoinjilete. obliteration of the suture with distortion, 

G. A female cranium (No. fi(17) showing complete, obliteration of the suture, w ith distortion. 

I). A male cranium (No. 36ft) showing complete obliteration of the suture without apparent 
distortion. 

E. A female cranium (No. 577) showing almost complete obliteration of the suture and {mst- 
coronal constriction. 

Plate XIV. Three male crania with sutural anomalies. 

A. No. 290. Large wormian bone in right side of eorounl suture. 

B. No. fill. Two symmetrical interparietal bones of unusual form. 

C. No. 33(1. Left side, allowing temporal lame largely fused to puriel-al. The right aide of this 
specimen is affected in a similar way. 

Plato XV. Anomalous regions of two male crania. 

A. No. 324. Complete almenee of right, auricular passage; twice natural size. The left auricular 
passage is normal. 

IS. No. 301. Complete alisenee of left jugular foramen: K1 times natural size, '(’he right foramen 
is normal. 

Plate. XVI. A male cranium (No, 3M2) of unusual form, possibly affected by hydrocephaly. A, 
Norma klcrali *j B, Noma famli*; C, .Vo nm vrriimlin, 

Plato XVII. A female cranium (No. (1112) with largo diseased area on frontal lame (nee p. 115), 
A, Norm lakralut; B, Norma fminlU-, C, Norma mtmli*. 

Plato XVIII. Exceptional crania, 

A, A male cranium (No. ft) with largo healed wound on frontal bone. 

B. A male cranium (No. I) with diseased area on right parietal. 

0. A motopio female cranium (No, 419) witli injury to the right frontal lame. 

I). A female cranium (No. 513) with depression in left, parietal, 

Plate XIX. A fomalo cranium (No. 51H) showing a tooth (upper right second molar) with a filling 
presumed to be adventitious. 

A. The palate (1*2 diameters). 

B. The second molar with the filling in nitu ((H) diameters). 

C. A skiagram (2-7 diameters) allowing the, depth of the filling. 

I). The three molars (2*7 diameters) after removal of the filling in the second molar. 

The skiagram in this plate and others in Plates XX II and XXIII were kindly provided by 
Mr 0. Bawdier Henry, M.Il.C.B. 

Plate XX. A female skull (No. 437) with anomalous jaws. 

A. The mandible from above, showing denticles outside the dental arch. 

B. The palate showing a diastema between the central incisors, and diastemao between the 
lateral meteors and canines. 

0. Tho right side of the mandible showing denticles. 

Plate XXL Palates with anomalous dentitions. 

A, A female cranium (No. 4*45) with sockets for three incisors only/ 

B, A male cranium (No. 132) with diastemao between canines and promolars, and third molars 
absent. 

0. A juvenile cranium (No. 705) with supernumerary tooth behind the right central incisor, 

D. A female cranium (No. 401) with grossly deflected left canine. 
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Plate XXII. Photographs and skiagrams of jaws with anomalous dentitions. 

A. A female cranium (No. 506) showing incomplete eruption of the second and third right molars. 

B. Skiagram of the same specimen (A) showing that the roots of the partially erupted teeth 
■were fully formed. 

C. A female cranium (No. 383) showing a denticle in the palate. 

D. Skiagram of the same (0) showing the limits of the denticle and its crypt. 

E. A female cranium (No. 496) with impaction of the upper left third molar. 

P. Skiagram of the right side of a mandible showing two dentioles in the corpus, female (No. 
1056). 

Plate XXIII. Jaws with anomalous dentitions or cy3ts. 

A. A skiagram of a juvenile jaw (No. 1068), right side, showing canine and first premolar 
unerupted. 

B. A skiagram of the same mandible (A), left side, showing the same teeth on this side unerupted 
(see p, 120). 

C. A male skull (No. 72) showing the third molar on the left aide in abnormal position. 

D. Occlusal view of the same jaw (C) showing the abnormal position of the third molar, rotation 
of the second premolar on the left side, a retained milk canine, and absence of the third molar 
on the right side (see description on p. 119). 

E. A female cranium (No. 469) showing a large cyst in the right molar region. 

E. A female cranium (No. 467) showing a large cyst in the anterior part of the palate pene¬ 
trating to the nasal aperture. 
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Plate III 



A. A female skull (No. 485) showing 
extensive burnt patch. 


B. A male skull (No. IDS) allowing injuries 
which probably mu ml death. 
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C. A male skull (No. 15(1) of exceptional D. A mule skull (No. 1711) of exceptional 

type, type and similar to C. 
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Plate V 



II. No. 114. 

Male crania with trepanned openings 
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Plate VI 



B. No. 073. 

Marked artificial deformation of a male (A) and a female (B) cranium 
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Plate VII 




10. No. 371). K, No. 3H(I. 

Ive male crania (A~E) believed to bo. artificially deformed to a slight extent and a male 
cranium (P) probably deformed owing to premature closing of the coronal suture, 
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A typical female (No. 388, above) and a typical male cranium (No. 28) 
of the Lachish series, 


Plate IX 
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Risdon: Shulls from Tull Duwr.ir (ImcIuxIi) 


Plate XI 



A. No. I1H8, female. 



15. No. 28, mule, 

A typical female and a typical male cranium of the Lachlsh series. 
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A typical female (No. 388, above) and a typical male cranium (No. 28) 
of the Lachish series. 


Plate XII 
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Plate XIV 



A. No. 291), largo wormian bone in right nidi' 
of coronal Hiit uir. 





li, No, (id. two symmetrical tuiiTpiirictiil 
hones of umisiutl lorin. 




a No. 330, temporal bone largely fused to parietal. 


Male crania with sutural anomalies, 
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A tooth (upper right second molar) with a filling presumed to 
be adventitious: No, 518, female. 
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Plate 



A. I Iflilicli"* mil mI> ill.- in«< r il>nti»I in. Hi, 



V. .Denticles outside (In 1 right side nl' the 
lower dental arch. 


H. Diiisti'imir hetween nculnil incisors anil 
between lnteml incisors and eimines. 


The anomalous jaws of a female skull (No. 437). 
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A. tiuckeU (in- (la 


ii'mimiiritry (noth Miiiul ri^Iit caniml 
incisor.' Xo. 7115, juvenile. 


II. (irossly (Iclleotrrt left Cimilu 
•Vo. 'till, female. 


Palates with anomalous dentitions. 
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Plate XXII 
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Jaws with anomalous dentitions, 
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taw* with onomalous dentitions or cysts 


A ovHt in iln* anterior port of the i*Hrt 
No. 4(17, fewalo. 
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Foreword by G. E. ZoBell 

Several different methods have been described for estimating the densities of 
bacterial populations in solutions. The most popular of these are plating pro¬ 
cedures, the minimum (successive) dilution method, and various direct micro¬ 
scopic counting methods. The accuracy of the two former methods is predicated 
upon the ability of the bacteria to multiply in nutrient media. Since no one 
medium under any one set of conditions can provide for the multiplication of 
bacteria with highly diverse nutritional and environmental requirements, it is 
not surprising that direct microscopic counts on materials containing a hetero¬ 
geneous bacterial flora are usually appreciably higher than plate counts or 
dilution method counts. However, direct microscopic counts fail to differentiate 
between dead and living bacteria and are beset with almost insurmountable 
technical difficulties due to the minuteness of the bacteria. Therefore the choice 
of an enumeration procedure usually rests between plating procedures and the 
dilution method. 

Briefly, and in its simplest form, the dilution method consists of the following 
procedure: A sample (e.g. 1 c.c.) of the solution under investigation is taken and 
inoculated into sterile nutrient medium in a test tube. Part of the original solution 
is then diluted in a certain ratio (usually 10-fold) and the same size sample of this 
diluted solution is inoculated into nutrient medium in a second test tube. This 
process is repeated as many times as seems necessary; so that in the end the 
experimenter will, have prepared a series of inocula representing successive 
10-fold (or some other ratio) dilutions of the original solution. The highest dilution 
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in this .series which shows growth is Han ? thru to mribate the magnitude of the 
bacterial density >» the original w»h«fi'»n. 

Such an experiment, however, obvimndy tell* nothing, but an integral power 
of 10 which more or less accurately determines what phvsbisU call the “order 
of magnitude" of the density. To obtain greater amiraev. investigators (cf, 
Halverson & Ziegler, 11133) have suggested that two to ten nr more tubes of 
nutrient medium he inoculated with each dilution of the material. Upon the 
basis of results obtained in such multiple tube exjmriment*, several methods of 
estimating the bacteria! densities which gave rise to them 1 results have been 
suggested, as described by Mr (Jordon. 

In comparing the minimum dilution method with plating proredumr for the 
enumeration of marine bacteria, in tins laboratory, the dilution method prob¬ 
ability tables of Halvoraon & Ziegler (11(33) were used. Them* tables give esti¬ 
mates of bacterial densities based on the numbers of “positives" (i.e. showing 
growth) observed in ten inoeula of each of three wuemudve in fold dilutions, 
More dilutions than this (e.g. live or six) were of room* in most eases prepared, 
and the estimates were based on the most critical net of three of these dilutions. 

The 'results revealed that the dilution method estimates averaged about 
20% higher than plate counts on the same material. Although on some in¬ 
dividual samples they wore actually lower. Duplicate determinations by the 
two methods on the same samples of material showed that the degree of repro¬ 
ducibility of the plate counts was much higher than that for the dilution method 
counts. In view of the fact that the dilution method has applications where the 
plate count cannot lie used, further efforts were marie to increase its accuracy, 

This can be accomplished in either of two ways; namely, by inoculating more 
tubes with each dilution or by using more dilutions. The latter alternative was 
tried in which, instead of diluting each time by J0-fold, the dilutions were made 
^/10-fold, Under these conditions it seemed that more reliable and more repro¬ 
ducible counts wore obtained by the dilution method even when calculated by a 
crude arithmetical method. 

It was at this stage in the experimental work when Mr (Jordon was consulted 
to aid with the calculations. After examining the methods by which Halvoraon 
& Ziegler obtained their results, he expressed the opinion that these methods 
appeared questionable, He has developed the procedures presented in the 
following paper for determining the geometric mean estimates, as ho has described, 
It is to be hoped that his method will yield more consistent, and more reproducible 
results, A brief tesunad of the results has already been published (Gordon, 1938). 

I. Introduction 

The underlying assumption by which estimates are made of a bacterial 
population density, from the results of the successive-dilution technique, is that 
the individual bacteria are distributed in the space occupied by the fluid medium 
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containing them, in the same way that we should conceive molecules to be dis¬ 
tributed in a solution, if each molecule were unaffected by the presence of any 
other molecule. (This is not, for instance, true of molecules possessing magnetic 
or electrical polarity, or of ions,) In other words, the individual bacteria are 
assumed to be randomly distributed in their medium; so that if, for example, a 
sample of 1 c.c, is taken from 100 c.c, of solution, each bacterium in the whole 
solution enjoys an independent chance of 1 out of 100 (probability = 0-01) of 
being caught up in this sample. 

Now, in so far as the bacteria exist as individuals they undoubtedly do not 
satisfy such an assumption “to the letter”, but probably exert a certain mutual 
uniformizing influence on one another. That is, returning to the example just 
cited, if there are 10,000 bacteria in the 100 c.c. of solution, the probability is 
likely to be in reality 0, and not 0-01 10 > 000 , that all the bacteria should be caught 
up in the 1 c.c. sample. This, however, simply implies that in a series of such 1 c.c. 
samples the numbers of bacteria caught up in them are somewhat more closely 
clustered about the true average density, than we should compute them to be 
on the basis of the above assumption of complete randomness. 

In this sense our assumption of complete randomness amounts simply to 
“considering a least favourable case 11 if we are computing the probable “spreads ” 
(standard deviations) of actual counts. On the other hand, the assumption causes 
us to over-estimate somewhat the probabilities of obtaining no bacteria at all 
in our I c.c. samples, and to underestimate the probabilities of obtaining “one 
or more ” viable bacteria in the samples. It is these probabilities that enter into 
the estimate of population density by the dilution method. No possibility appears 
for rationally correcting for such discrepancies; fortunately, however, the two 
errors committed will to some extent cancel each other in the formulae which 
follow, But there is suggested in these considerations the possibility of developing 
rational means of measuring the uniformizing influences associated with various 
specific cultures in nutrient solution. The values obtained might be correlated 
with population densities and various specific properties, and might yield very 
interesting interpretations. 

Another, perhaps more serious, objection to the assumption of “random¬ 
ness ”, is the known faot that most species of bacteria show tendencies to gather 
in multicellular groups within the solution, as well as to congregate on glass walls 
and thus to go out of solution. These processes must affect all methods of counting 
equally, however, so far as the samples are taken in the same way; hence they 
should not interfere with the comparability of counts made by different methods. 

It might be mentioned here, though, that in comparing plate counts with 
dilution estimates, account should be taken of the fact that the dilution estimates 
include obligative anaerobes, while plate counts do not. 
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II, Methods m khtimatixo mrri.ATm>s vwm hehi-ltk ok 

M?rt'r,H«iviM>ii,nnt»N TKriiswK 

Assuming random distribution of individuals in a large volume of solution, 
as described above, let p represent the total number of individuals in the whole, 
solution, divided by the total volume That is. p represents the mean 

density of population in the solution, expressed in a number of individuals per 
unit volume. Then the probability that in a wimple of one unit volume (e.g. 1 c.e.) 
of solution there will 1m / individuals {where r is a. js witive integer ) is given by 

/VI* £ A (1) 

which is the Poisson distribution. 

The probability that there are ft bacteria is accordingly /’|ft) r "D anti the 
probabilityof obtaining: one ormorehnrteria in the sample {i.u, any number except 
0) is 1 -P(tt) » 1 »(. <’, since the distribution (1) is normalized. Hence if we take 
ten teat-tubes containing sterile nutrient solut ion, and inoculate each with 1 unit 
volume of our solution, then the probability that « (^lu) of these tidies will 
have received Home number of bacteria, and {10- n) will have received none, in 

uwr»)- (, “ r "* w 

by well-known rules for computing probabilities. If we understand p to represent 
mean density of triable bacteria, then GJn) in (2) represent*! likewise the prob¬ 
ability that n of the ten test-tubes so inoculated will show growth, and simul¬ 
taneously the other (10—n) of the tubes will fail to show growth. 

Let, us now inoculate three rows of ten tulxw each, the first row with samples 
of the original solution, the second row with samples of a infold dilution of the 
original solution, and the third row with samples of a 100-fold dilution of the 
original solution, Let p represent, the mean density of individuals in the middle 
(10-fold) dilution, and denote by n 10 , a, and n m the numbers of tubes showing 
growth in the three respective rows. Then the probability of observing a result 
represented by the triplicate (w 10 ,n„%,} « n is 

Qf(n l0) n lt n H ) » Gf(n) * G; 0p (» l0 )»,,(«,) 

_ (10!) 3 (e“ 1Q /') 10 ” n w (g™/>)io -a, -»», { \ „ e - (i _ e - /tpn (i - e -o-i/>y«o i 

n 10 1 1(10 - n H ) ! (10 - n,)f( 10- n^)! 

( 3 ) 

It is upon these three equations (1), (2), and (3) that several methods of estimating 
population densities from the results (n 10 ,%,%].) = n of successive dilution 
experiments are based. (These equations of course refer to successive 10-fold 
dilutions, and ten tests to each dilution. Analogous equations are easily formed 
corresponding to other dilution stages and numbers of tests,) 
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Wells & Wf’tSn s'! !<ii 11 hnvr puhlt*Itcd H method of estimating population 


density. Their applied to the present cast*, amounts simply to computing 

three vainer of p. one f«« fit each of tlm wpmfionH 


t 




m-n 

.. ui 


Itt, 


m n t. ,.«■}„ „ 

in * to ’ 




whw (n,„. n 0l )»re tin* n»mi*erwaf " positives” in the; three scries of tests. The 
geometric mean { (pwq/Vt)these three values is then accepted as an estimate 
of the true value of/*. 

On the surhi'-e of it, wnh a way of arriving at an estimate appears rather 
naive. Such direct computations are usually unsafe when the data are subject to 
large uncertain tu*#. Apparently the only reason these authors used a geometric 
mean of the three values instead of some other mean is that from the equation 


t n> at 



(5) 


we obtain 
which is linear in 


log//.«log (log j— ) ~ ] “K r ' (®) 

log ft, log r. and log (log j. so that the arithmetic means 


of these terms are related in the same way as the terms themselves, (Note that 
|[logp ir t logp, + Ing/ki! «■ lng(PipP,p frl )‘.) 

However, bacteria counts are usually expressed " to a certain number (e.g, 3) 
of significant figures"; that i«. interest is centred on the proportionate errors, 
not on absolute errors. Tlw difference hetweeji no bacteria and one is more 
important than that between Hint) anti 1001. This consideration indicates that 
the expectation of lug/>, given the data, is more like what we want than, say, the 
expectation of p. Hut it does not follow that the mean of three separate estimates 
of logp is the best estimate of log p, because the estimates arc not equally accurate; 
they should be weighted by the inverse square of the standard deviation of logy 
corresponding to an observation n r which would have to be computed from an 
"inverse’' distribution derived from (5) by use of Bayes’s formula. 

Halvomon k Ziegler (1033) published a mimeographed tract in which they 
presented tables giving the modes of (??,'(n) (sec equation (3)) together with the 
corresponding maximum values of (^\n), corresponding to various combinations 
( w iO' w i> ?( o*i) ES « (that is, considering Gf{n) as a function of (>)■ Their notation 
is not the same as that here used; it is as follows: 


p *> X w p corresponding to maximum of Gf{n), 


Gf(n) » (?$(n) - P, 


%q ~ Pl\ % = Pi' %1 “ TV 

This estimate p of p ia obviously that obtained by the “method of maximum 
likelihood", which is associated with the name of R. A. Fisher. The relation of 
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this method to the principle of invent*' probability how dm-uwied by •TeffreyH 
( 19380 , 6 ). It is equivalent t« taking » uniform prior probability for p, mid then 
adopting the ramie of the posterior probability a* the estimate 

In the present instance the significance of a modal estimate * • i /*« reduced hy 
the fact that the a pmimtm distribution* *4' p (computed m parti* nlar from 
uniform a priori probability) vary widely us «l»*wnm« with different values of 
the n r '». This ia indicated hy the eomparwoii *4 thm> >•nmputed geometric menus 
with corresponding modal iwtimaUwi in the S«.«t wwtioii of thi* j«ij**r. The point 
is verified by a aeries of laboratory pompariwm* «t modal tllalvorMin h Ziegler) 
estimates with direct plate count* on the juitne materials, reported elsewhere 
(Gordon & ZoBell, 1939). Thus the relationship of the mode to the distribution 
of which it is a reptwntative ix a varying quality, and the mode is deprived of 
any immediate significance. 

It has already been mentioned that the particular value of p thnl is best 
adapted to the purposes of biuteriuhigy eormq««ndH to the «*xjw‘vtnthm of the 
logarithm, on the ground that we are more interested in the ratio of p in two 
samples than in the absolute dilfercnce Further, since p eats have values from 
0 to co, but log p from -as to * *, the probability of log p ho* an opportunity of 
being nearly normally distributed that i* denied to /n similarly for correlat ion 
tanh 'Vh and for the ratio of two standard deviation* lug (■*, have nearly 
normal probability distributions, ami have been extensively used for that reason 
on Fisher’s recommendation. If we mm find the ex{welatio» of log p and the 
second moment of its probability distribution, a normal curve with corre¬ 
sponding mean and standard deviation should give a good representation of the 
law as a whole, 


III. Tub estimation* of im p 

Using the Bayes-Laplace formula with uniform (trior probability for p, we 
obtain from (3) for the posterior probability distribution of p. 


P(P) 


The expectation of log p is 


HP Pip)dp. 


(?) 


which, cancelling factors in (3) and (7), equals 
/*00 

log pe~ (m-10tt »-n (l _ e -lO/>pt „ (( _ dp 

log P ---—--- 

e -au~10n,»-n l -0-In 8 . l )p ^ _ e --10p)ni, (( _ e -p)n, ~ tr ^py^xdp 
J Q 


(8) 
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This integrands cun obviously bo expressed in the form 

| log p or 1 i (1 - E m ) n «(1 - (l - E) n «•! e - iVw>, 

where E in the Boolean operator -• (1+/J), A meaning difference with regard to 

•i' 0 - 1110— lt)()tt 10 — lOn^-n^. 

Since E is commutative with the sign of integration, we may therefore write 

(1 — (l - E 10 yn (l - Jg?)«o-i log pa ~ iV*o P dp 

log/i = ....^-4--. 


(i _ Jjmyi u (! _ E «•)», (i _ £)«« e - fop dp 

Jo 


(9) 


That this is ho may be seen by expanding the operator 

H » (i -7^ioo)»i t . (i _/gio)ji, (i_ flyin (10) 

as a, polynomial in E , then operating term by term on the integrands. The result 
will he the same as the result of expanding the integrands of (8) in powers of e. 
The integrals in (!i) may ho determined by means of the following identities; 






l^l(F(u+ l)-log*), 


( 11 ) 


where F[u + 1) is the "digamma function ’’defined by F(u+ 1) = dlogr(w+l)/dw. 
(see British Ammatim Tables, vol. 1 (1931) with a different notation). 

Place a ~ 0 in these identities; there results 


e "■ dp 


10 


J (1 log (> e “ hw dp = 1(, g ij) > 

where /-(l)« -0-57722. 

In this way we obtain simply from (9), for log p 

flog as 0 


( 12 ) 


H' 


log/3 = f(l)+log 10- 


*0 


H 


(13) 


where H is the operator defined in (10). 
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IV. Tub HTANiutm heviatiok of lag p 

The standard deviation of log/) is the square root of the second moment of 
log p about its mean log//. Using the ojieratnr--notation of the previous section, 
with standard formulae of statistics, we obtain 


(Ti 


//j ' (Iug/)) 2 e hvdp^ 


Sis i > 


H 


Now, as before, 

) 

(log/)) 2 )! " h a/' dp sa 


t * r»¥ dp 


' tl’i 1**1 

u"v h/ttP dp 

dir „ ), 


(H) 




to 


whence we easily find 


I"K i> 


i {^( 1 )}* + ^( 1) + (log 10} 4 + 2F[ t) log 10 
o 

• AH 1 { > + ?{ 1 )/h)g /„ + {log ,r Q } 2 ], 


• (log/?)* +(/( 1) + log ld| a + F(l) 


(16) 


■W)+logic] 


It?**’ 

1 a-fl 


II 


II 


+ * 


f(l»K^o) g \ 
% ! 


B 


(16) 


where 


f{ 1)= -0-577216, F{ 1)« 1'844034. 


V. TRANSFORMATIONS 

In the fractional terms in (til) and (16), the numerators and denominators 
each represent finite alternating series; but these series are useless for computing 
purposes, because the terms are so nearly equal that each would have to be 
expressed with at least twenty significant figures to yield sufficiently accurate 
results. Hence we carry out tire following transformations: 

Referring to (10) we take out (1 - E) ln - (-rip 1 as a factor from the operator 
H, and obtain 

H = HcM) N , (1?) 

where N = Sn = %+«!+%!, and 

H 0 = {l + E+,.. + E i9 )^(l + E+.,. + E a ya. 


( 18 ) 
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If this last expression is expanded, evidently all terms mast turn out to have 
positive coefficients; so that if the functions 



can be accurately determined, the required end result is simply a weighted sum of 
terms of like sign, and there is no cancellation. 

In the first place it is very simple to obtain 



JV!(*-l)l 

(x+N)\ 



by mathematical induction. 

To obtain (~d) N (logxjx) it suffices to expand log (x+u)f(x+v) as a power 
series in u and operate with (-d ) lV termwise on successive powers of v = 0, using 
the formula 


A N W = 


P ' r(-n) . 

(y-N)l P ~ N ’ 


( 20 ) 


where B { ~]§ are the “generalized Bernoulli numbers” of order (-N) and degree 
(p -N) (cf. L. M. Milne-Thomson, 1933, p. 134). By this means we finally obtain 


/■ niV/|iv. lo S^_ -^Ky—!) ! 
1 ' x ~ (x + N)\ 


logx 


“ (_1)P-W+1 p\ 

(p^N)\ 



( 21 ) 


By similarly treating log(x + N ~o)/(x+N ~rv) it is easy to obtain also 


[-If A* 


log x__ N\(%- 1)! 
x (x + N )! 


log (x+N) 


p\ 




(i x+N )»+! (p-N)\ 


1+- + ... + ~j^~ J zv- (21 a) 


These formulae (21) and (21a) can. evidently both be used simultaneously to 
obtain upper and lower bounds to the required result. Both are convergent if 
x>N. 


The sums 


, 1 1 1 
1 + 2 + 3 + ‘"+p 


in (21) and (21a) can be computed directly, or by means of various short formulae 
of which the following may be suggested (Boole, 1872, p. 92): 


1 + J + ... + - = 0-577216+ lGgp + i-~ 

2 p 2 p 


1 

12 / + 


1 

120 /'"' 


(22) 
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The Bernoulli numbers are easily computed by means of the reversion formula 
(Milne-Thomson, 1933, p. 129), 




vB&P+ Bfr» + » 


(23) 


1 + 


N 


together with the values Bf ] = 0 if u^O; B { 0 ~ N} = 1. Also polynomial formulae 
are given in Davis’s tables. We have computed them as far as N - 27, v — 7, 
of which a tabulation is presented at the end of this paper. 

((log*) 2 ) 


To deal with the function A N 


we make use of Boole’s operator identity 

= {<&**- i)N (24) 

If this is expanded in powers of d/dx and made to operate on (log :e) 2 /.'c, the result 
is a series in the derivatives of the latter function. These derivatives are found 
to have the form 


d v (logs;) 2 _ (- l) iJ p!(loga;) 2 p 


dx p x 

and the final result is 

00 


rPl 1 


{2 log x + (p — 1) 


—£_:—{(-!)?) (j 0 ir x f -p 2 ] °£ xB & -p lv: 

(p-N)\ x p +i\ [ l (p_ 2 )! 


Pi 


s 

® P=N 

In an analogous manner we also obtain 


(25) 


A N 


(log®) 2 


<-i)*E 


p! 


TH-N) 

n p~N 


[log(x + W)] 2 


p=n {p-N)\ (x+N ) p+1 1 
| (- lp 2 log (a+IV) B lp Ap t (-1 


(p-i)! 1 (p-2)!". r (25a) 

As (21) and (21a), so also (25) and (25a), can always serve together to give upper 
and lower bounds to the values sought, so long as x 0 > N, 

For use in the last equations, the following formulae may be found useful: 


pfo+P /ll 1\ 

(P-1)! 1 ' \ + 2 + 3 p)' 

Ult-i p/^V !- 1 )*- 1 

(P-2)! 1 ixrir a * 


(26) 

(27) 


(i) Expansion of the operator H 0 

The operator H 0 (equation (18)) taken as a polynomial in E ; has the form of 

|> 0 (Z) = (l+«+...+Z 9t, )«io(l + Z+...+29)% 

(1—z) m i»+ 7l i 

= P 0 + P x z + P 2 z 2 + ... + P„Z" + ... + P 09nio+9ni Z««"ic+9%, 
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where P v = P 99)tl0+9ni _ 


with 


■Oil j -In— 10 i- 

s s 

r=0 s—0 


„=£ £ (-ir +s ^c^Oi;i 1QQ) ._ 1Qs 


6 ! 


100 r- 10 s +1 


. ( 2 8) 


nb _ 

“ o!(6-o)!‘ 


Thus, with these values of the P u ’s, we have 

H 0 = P 0 + P 1 E+P t E»+...+P,BP+.... 


(29) 


(ii) Computation of P u for large v 

When v is fairly large, that is, considerably different from both 0 and 
99% 0 + 9% + l, the formula (28) becomes unwieldy and inaccurate. By means 
of a Laplace transformation, that is, by making use of the so-called “character¬ 
istic function” for P„ (see Uspensky, 1937, chapter xil), we are able to obtain the 
following Fourier series representing the continuous function which is obtained 
by plotting the points (u, P u ) and then connecting them with straight lines: 


where 


and 


P u = Pitau+m,-, = £ A oos^f(u-b), 
/»=o -o- 

Ii = 99% 0 + 9% + 2; 6 = |(99% 0 +9%); 


(30) 


An ~ 


1 0 2w io+>il 


99?ijLQ + 9n^ + 2 


A -JL 

< l 8nY 


/ . IOOttiA" 10 / 

\ n 7T~) ( 


. 10nu\ 

Bm ~jr) 


*h 




for p=£ 0. 

If /ijK is an integer, + 0, then 


(^f) 


Otherwise obviously 


Q/ll0+7?-l 

^ = 8^V 0"M+%-* = °' 


!^l<- 


sm 


n/a 

~K 


8tiV 2 

14 !<-*_ 

1 /‘ 1 87T 2 /t 2 


StP/U 

] Q 2 tIio+ 71 i / 77- \ 2 .1 

< toy (“"ioo) “ !/*/■£-»I 

10 ,l iZ / 


/ . 7T \-"io+ 2 1 1 

( sin 2 oo) 

I . 7T \—"I" - »i+2 1 

\ Sm 20) ’ IP/^-^1^20- 


The first limit is the greatest. It follows that 

“ , 2n/i,r , . 0-000247 x 10 2 ™i»+ )l iZ " 1 

2 A coa~f(^b)< £ \Ap\<- -r-j- £ Ta 

0-000247 x 10 2 "io+»iZ f°°da: 0-000247 x 10 2,l io+%i£ 


8tt 2 




8n 2 p 


(31) 


which can serve as an error function for (30). 
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VI, Remarks 

Sufficient mathematical tools are presented in equations (19), (21), (21a), 
(25), (25a), (28), and (30), for actually computing the geometric mean estimates 
p of densities of bacteria from the results of successive dilution experiments, 
together with the standard deviations of their natural logarithms, from formulae 
(13) and (16). The standard deviation of the logarithm of course will be a measure 
of the proportionate accuracy of the estimate, in the same sense that the ordinary 
standard deviation is a measure of the arithmetical accuracy, (It may be remarked, 
incidentally, that all logarithms occurring in the above formulae are understood to 
be natural logarithms.) 

If we assume in general that the function P(p) defined in (7) can be very nearly 
approximated by a normal distribution function with respect to logp which 
should be satisfactory for bacteriological purposes—tben corresponding to 
o" logp there will be a “probable error” 0-675cr logp = s. To this will correspond a 
“probable error ratio” 

e = e 8 -l = 10°' 293o '-l, 
that is log 10 (1+e) = 0-293tr logp . 

The meaning of e is that the probability is approximately that 

“I ~P<P~P<^ 

To save labour, it remains yet to determine limits to the extents of errors 
committed by adding only, say, every twenty-fifth term corresponding to the 
operator H 0 = ^P U E U , 

(»> 

in making the computations; or else to determine short formulae for the required 
sums, For the purposes of rigour, upper bounds will also have to be determined 
for the Bernoulli numbers used in the formulae. 

Mrs Naomi Lancaster has made rough computations of several values of p 
as shown below: 


Argument 

P 

computed 
by us 

5 from 
Halvorson 
& Ziegler 

% deviation 
from Halvorson 
& Ziegler 

»10 

n 


10 

7 



■Iff 

—7-0% 

8 

5 


.■fill 


+ 9-0% 

4 

2 

1 

. 

0-080 

+7-5% 


If we were to plot these percentage deviations against n w> we should be led to 
expect a maximum positive deviation of perhaps 12 % or more in the region of 
= 6 or 7. 
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Dr ZoBell has compared plate counts with corresponding estimates from 
Halvorson & Ziegler’s tables, and their ratios show exactly similar trends to 
those indicated by the above three comparisons, but relatively even more pro¬ 
nounced. These will probably be reported elsewhere in detail. 

In order to make possible the practical use of these results by bacteriologists 
and others, as well as to test their validity experimentally, it will be necessary to 
prepare tables of p and e as formulated in the preceding pages, corresponding to 
the tables previously prepared by Halvorson & Ziegler. This will require financial 
aid from some source. 

The writer wishes to express his appreciation to Dr 0. E. ZoBell for acquainting 
him with this very interesting problem, and to Dr George E. McEwen for his 
encouragement and occasional aid in carrying out the work and making available 
the necessary literature. Dr Harry Bateman of the Mathematics Department, 
California Institute of Technology, and Dr Risselman of the University of 
California at Los Angeles, were also helpful in bringing the analysis through one 
or two difficult points. 


APPENDIX 

Table of Bernoulli numbers B[~ N) 


V = 

1 

2 

3 

4 

5 

6 

7 

N= 1 

1 

1 

1 

1 

1 

1 

1 

2 

3 

4 

5 

6 

7 

8 


1 

7 

3 

31 


J27 

85 

2 

6 

2 

15 

3 

"If 

12 


3 

5 

9 

43 

69 

3025 

311 

3 

2 

2 

2 

5 

4 

84 

4 



13 

10 

243 

185 

• 6821 

1325 

4 

2 

3 

10 

3 

42 

3 


5 

20 

75 

331 

675 

11215 

5225 

5 

,2 

3 

4 

6 

4 

21 

3 

6 


19 

63 

1087 

777 

30083 

5432 


2 

2 

10 

2 

21 


7 

77 

49 

1939 

4753 

9992 

43120 

7 

2 

6 

10 

6 

3 

3 

8 

4 

50 

72 

4819 

1476 

146240 

33664 


3 


15 

21 


9 

9 

21 

405 

2614 

2565 

93886 

573975 





8 

2 

4 

6 

7 

10 


155 

275 

762 

12650 

677465 

667325 

5 

6 

2 

3 

28 

4 

11 

11 

187 

363 

16258 

26499 

1158509 

3164029 

2 

6 

2 

15 

4 

28 

12 


12-2 
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Table of Bernoulli numbers B[ N) (cont.) 



1 

2 

3 

4 

5 

6 

7 

N = 12 

C 

37 

234 

15149 

10 

10023 

2842373 

42 

466725 

13 

13 

130 

1183 

20631 

176267 

2238730 

9453353 

2 

3 

4 

10 

12 

21 

12 

14 


301 

735 

82439 

41895 

975623 

7704026 

7 

6 

2 


2 

6 

6 

15 

15 

2 

115 

2 

450 

7181 

2 

29175 

10129615 

42 

2026550 

16 

8 

196 

544 



7330868 

9329056 

3 

5 

3 

21 

3 

17 

17 

221 

2601 

87601 

106641 

10384229 

37236205 


3 

4 

15 

2 

21 

8 

18 


165 

1639 

36483 

70281 

19239635 

27257571 

mm 


5 

28 

4 

19 

19 



45049 



39123375 

... 

6 

2 

— 

12 

28 

4 

20 

10 

306 

1060 

66049 

117076 

17672415 

41371225 



3 

6 


14 

3 

21 

21 

112 

4851 

133217 

7714707 

5022439 

38264149 

2 

4 

10 

62 

3 

2 

22 

11 

737 

2783 

159819 

1115983 


78466685 

6 

2 

10 

6 

21 

3 

23 

23 

805' 

1587 

570653 

461817 

59587135 



2 

6 


2 

21 

3 

24 

12 

146 


112379 

6 


76306988 

21 


25 

25 

475 

8125 

26380 


96763700 

1486135625 


2 

3 

4 

3 

21 

24 

26 

13 

1027 

6 

4563 

2 

461578 

15 

419796 

486388487 

84 

322887513 

4 

27 

27 

369 

5103 

178462 

2019087 

202034577 

416849895 

2 

■a 

mm 

yH) 

4 

28 

4 
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NOTE ON THE INVERSE AND DIRECT METHODS OF 
ESTIMATION IN R. D. GORDON’S PROBLEM 

By E. S. PEARSON 

The practical solution of the problem of estimating the mean density of a bacterial 
population by the dilution method requires not only the determination of 
a single-valued estimate, but also some measure of the reliability of this estimate, 
By the introduction of an ingenious mathematical procedure Mr Gordon has 
taken, in the preceding paper, a first step in the solution of this problem on 
lines involving the application of Bayes’s theorem.* Thus if p is the mean 
density per unit volume at a given dilution, he obtains an a posteriori distribution 
for p and, since it seems likely that the derived distribution of logp, rather than 
that of p, will be approximately normal, he shows how the expectation and 
standard deviation of log p may be calculated. With the help of these a probability 
statement regarding the unknown p of the form given on p. 178 may be made. 
To put the results into working form extensive computation will, however, be 
necessary. 

Contrasted with this inverse approach is what may be termed the direct 
solution involving the determination of a fiducial or confidence interval. This 
solution requires (a) the choice of an appropriate sample estimate of p, say R, and 
(6) the determination of its sampling distribution, say p(R j p). Mr Gordon refers 
to R. A. Fisher’s maximum likelihood estimate, say li L (Fisher, 1922, pp. 363-5), 
whose calculation in the case where ten tubes are examined at each of three 
dilution levels, the dilution factor being 10: 1, has been made easy by the tables 
of Halvorson & Ziegler (1933 a), but convinced as he clearly is that the inverse 
approach is the only legitimate one, he no doubt did not feel it necessary to 
discuss the possibility of a fiducial solution as alternative to his own. Since, 
however, a paper by Matuszewski et al. (1935) did present a preliminary working 
solution of this kind for just the same experimental arrangement—I will call it 
the 10, 3, 10 arrangement—discussed by Halvorson & Ziegler and by Gordon, 
it may be useful to make some reference to this result, and also to add a few 
comments on the difference between the two lines of approach. 

In their first paper (1933a, p.121) Halvorson & Ziegler gave a table showing the 
value of R l corresponding to each (or rather, most) of the possible combinations 
of w 10 , n v and % 1 , the number of tubes (in Gordon’s notation) out of the ten 
tested at each dilution which show growth. In their third paper (1933c) they 
carried out some investigations into the sampling variation of R L for a fixed p, 

* It is interesting to note that the pioneer paper on this subject by Greenwood & Yule (1917) 
also followed the Bayes’s theorem approach. 
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Taking the 10, 3, 10 arrangement and the four eases p = 0-15, 0-25, 0-50 and 
1'50, they calculated from the term of an appropriate multinomial expansion 
the probability of observing different combinations of n 10 , n v and n 0 . t ; hence, 
using their tables, they obtained the probability associated with different values 
ofjfo. Their results, presented in the form of tables and a diagram, show that: 

(1) The distributions of R L are asymmetrical. 

(2) The distributions of log-ftj, are more nearly symmetrical, but since Ji L 
can only assume a finite number of discrete values, the distributions cannot in 
either form be represented adequately by smooth curves. 

(3) Calculations from the partially grouped values give the results in Table I. 


TABLE I 


p 

0-150 

0-250 

0-500 

1-500 

Mean (R L ) 

0164 

0-284 

0-558 

1-648 

vGh) 

0066 

0-117 

0-226 

0-689 

log P 

- 0-824 

-0-602 

-0-301 

+ 0-176 

Mean (log R L ) 

-0-816 

-0-578 

-0-285 

+0-184 

cflogjy 

0-163 

0-164 

0-163 

0-168 

<r(logRi) (limiting 
| formula) 

0-1535 

0-1532 

0-1768 

0-1550 


N.B. Logarithms are to base 10. 


(4) The values of cr(log R L ) calculated from Halvorson & Ziegler’s probability 
distribution remain nearly constant in the range of p considered. Having regard 
to the respective standard deviations, the bias of loglt^ is of less importance 
than the bias of R L . However, if a method of obtaining accurate fiducial limits 
were available, the bias in the single-valued estimate would be of no im¬ 
portance. 

(5) The figures in the last row have been calculated from Fisher’s formula 
for the large sample value of the variance of a maximum likelihood estimate, 
which reduces in this case to (Fisher, 1922, p. 364): 


Mlog c R l )}~ 1 = 10 


g-10/i 

1-e- 10 * 


100pH- 


e~P 


■ e~P 


pH 


e ~l\p p2\ 

100 ; ‘ 
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This formula gives minima for cr(log R L ) in the neighbourhood of p = 0-16, 1*6 
and 16. A series of calculated values including those tabled above are: 


TABLE II 


p 


0-15 

0-26 





2-00 






0163 




0*166 

0-166 

0-163 



0-174 


The probability values tabled by Halvorson & Ziegler are not sufficiently detailed 
to determine how far the differences between the values of cr(\ogR L ) compared 
in the last two rows of Table I are due to inadequacy in the large-sample variance 
formula. It is, however, clear that the standard error of the logarithm of the 
maximum likelihood estimate changes very little with p. Halvorson & Ziegler 
reached a similar result by noting that (t(R l )Ip was very stable. The same result 
was noticeable in a series of calculations concerned with estimating the density 
of organisms in milk, made by Barkworth & Irwin (1938). These authors analysed 
the results of seven separate experiments in which 255 tubes were tested at each 
of four dilutions, namely 1: 10, 1: 50, 1: 250, and 1:1250. They obtained the 
standard error of the maximum likelihood estimate of p from a large sample 
formula analogous to -that given above; for seven values of p lying between 
19 and 67, the ratio cr(R L )lp lay between 0-058 and 0-062. 

It is clear that if more detailed computations of the probability distribution 
of R l for a wider range of values of p were made on these lines, charts or tables 
giving fiducial or confidence limits for p could be readily supplied. In their paper 
of 1935 referred to above Matuszewski dal. have provided one such chart formed, 
as will be described below, on a basis which is partly empirical. The chart may be 
used as follows: 

(a) Having observed experimentally the three numbers n 10 , n v and n 0 . v 
obtain from Halvorson & Ziegler’s tables the maximum likelihood estimate, 
R l , of p. 

(b) Taking R Ll read off from the chart lower and upper confidence limits, 
say Pl (R L ) and Pi(R L ).* 

(c) Then, using Neyman’s terminology, the statement 

Pi{ r l)<P<Pz( r l) 

may be associated with a confidence coefficient of 0-95. In other words, if this 
procedure is applied in general bacteriological practice to the three frequencies 
7i 10 , n v and w 0ll obtained from the dilution method, then the odds will be at 

* The authors take A' as the maximum likelihood estimate of mean density A in the most 
oonoentrated of the three solutions, so that A'=10 Rl. 
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least* 19: 1 that the interval Pl (R L ), p 2 (R L ) will cover the unknown mean 
density p. 

It should be emphasized that the chart provided by Matuszewski et al. was 
not based on an exact mathematical solution, but found by graduating a series 
of experimental sampling results; the following quotation from their paper 
(p. 76) explains the method employed; 

The method followed by Miss J. Supihska consisted in a complex sampling experiment, 
using Tippett’s random sampling numbers, The experiment produced a series of values 
of the variates aq and [i.o. fi 10 , n x , and in Gordon’s notation] following the 
sampling distribution which they would follow in our hypothetical conditions of the 
experiment, For each series of x 1 and x s it was possible to read up from the table of 
Halvorson & Ziegler an estimate, say A.', of the concentration A. The estimates A' have 
been then tabulated and an empirical frequency distribution of A' corresponding to several 
fixed values of A has been determined. Following the method described by J. Neyman, 
these empirical frequency distributions were then used to construct confidence intervals 
as if they were the accurate ones. As the random variation could not fail to affect the limits 
of the intervals it was felt necessary to correct them by fitting two parabolae, one marking 
the lower and the other the upper limits of the confidence intervals. 

While therefore the writers did not claim exactness for their results, their chart, 
combined with Halvorson & Ziegler’s table, does provide a provisional working 
solution not yet available to those who prefer the inverse approach. 

It is interesting to obtain from the chart the limits Pl (R L ) and Pi (R L ) in the 
three hypothetical cases discussed by Gordon on p. 178 above. In Table III these 
limits and also their logarithms are given. 


TABLE III 


Observed 

frequencies 

Gordon’s 

estimate, 

P 

Maximum 

likelihood 

estimate, 

Ri 

log Ri 

_ 

95 % confidence limitB 

_._ 

»to 


*0-1 

Pi 

P 2 

kg Pi 

_ 

log/b 

4 kg/v> 2 

10 

7 

3 

143 

1*63 

■j 



HH 

0477 

1 

8 

6 

1 


0*267 

Efim 






4 

2 

1 

HII 

0'080 

■ 



— 1-64 

HI 

Hii 


It will be noted that; 

(1) The 95 % confidence interval for p is in all cases relatively broad and, 
having regard to this, the differences between the single valued estimates p and 
R l are of little importance. 

(2) While neither p nor R L is central with regard to the interval, log R L 

* 095 is a lower limit to the probability of a correct statement, owing to the discontinuous 
distribution of the frequencies. 
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differs only slightly from |(logp 1 +logp 2 ), as will be seen by comparing the 6th 
and last columns. The length of the interval \ogp 1 ~\ogp 2 changes slowly, 
increasing as R L decreases. A study of the chart shows that for R L roughly in the 
range 0-6-2-4, the breadth of the interval logp a —]ogp x remains nearly constant 
at a value of about 0-55, increasing slowly as R L drops below 0-6 or rises above 
2-4. Without further details of Supitiska’s sampling experiment, it is not possible 
to make a closer comparison between this confidence chart based on random 
sampling and the four distributions of R L tabled by Halvorson & Ziegler. Both 
results suggest, however, that for a considerable range of values of p, log 10 iij, 
is distributed about a mean value of approximately log 10 p with a standard error 
of about 0-16. 

If finally it be asked whether the direct or inverse solution is to be preferred, 
the answer must, I think, be that this can only be a matter of personal opinion. 
As mentioned in the footnote on p. 181 Greenwood & Yule (1917) preferred the 
latter method. The fundamental difference of the two methods of approach has 
recently been emphasized by Harold Jeffreys, with whom it is to be supposed 
that Gordon is in substantial agreement. Writing with regard to the a priori 
distribution of an unknown parameter, such as p, Jeffreys says (1938, p. 466): 

I can find nothing hi the works of the pioneers of the principle of inverse probability 
to suggest that they identified the prior .probability with a known frequency, and believe 
that if such an idea had occurred to them they would have repudiated it as definitely as 
I do. The function of a prior probability used to express ignorance is simply to express 
formally the transition from an inference about different possible data, given the hypo¬ 
thesis, to one about different hypotheses given the same data, and this transition must 
be made somehow on any theory. 

It follows that if the prior probability distribution cannot he identified with 
frequency, neither can the posterior distribution. A probability, in Jeffreys’ sense, 
obtained from the integral of the posterior distribution between limits p 1 and p 2 , 
can he regarded as no more than a rational measure of the degree of belief that the 
experimenter may place in the truth of the statement p 1 <p< p 2 . It can have no 
precise link with long-run relative frequency. This, indeed, it could only have 
if there were reason to suppose that in repeated dilution experiments the popula¬ 
tion value of p would be distributed uniformly between 0 and oo.* But the 
legitimacy of any attempt to make this connexion Jeffreys has denied emphatic¬ 
ally. Tor him, if I understand rightly, the posterior distribution derived from 
the formal prior distribution provides the one type of numerical sealing which 
he regards as useful in forming an opinion on the value of an unknown constant. 
To quarrel with this conviction would be out of place. 

On the other hand, those who agree with Jeffreys must recognize that the 
direct approach with its fiducial argument has resulted from the development 

* Gordon has assumed this to be the most appropriate form’of prior distribution for p; possibly 
Jeffreys would prefer to make the distribution vary as 1/p. 
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of a line of thought which finds probability statements useful in a form in which 
they can be directly related to long-run frequency of occurrence. To know that if 
a certain experimental technique is carried out and an arithmetical calculation 
made, then there are strong grounds for believing* that about 19 times out of 20 
the limits p x {R L ) and p 2 {R L ) will include the unknown p is a form of information 
which appeals to a large number of statisticians. In this and other problems it 
seems likely to appeal particularly to persons who are frequently repeating the 
same form of operation, and can therefore the more readily appreciate the con¬ 
sequences of “being wrong” once in 10, once in 20, or once in 100 times, according 
to the risk they choose to allow. It is true that in certain instances there may be 
other special information which will enable the experimenter to guess at narrower 
or modified limits, rather than those obtained from the standard fiducial pro cedure. 
But whether this information is ever of a kind which can be put into numerical form 
is doubtful, Certainly this would not be achieved by using the prior probability 
distribution proportional to dp or dpjp. The fact that we can sometimes narrow 
the range of uncertainty and so get nearer the mark, does not detract from 
the long-run “safeness” of the fiducial argument. Of course, whatever the 
approach, it is essential that the sampling should have been random. 

From the point of view of the bacteriologist this difference of opinion between 
experts may be discouraging, but it has been shown more than once that the two 
lines of approach lead to results which, from the practical point of view, are almost 
precisely the same. Thus Jeffreys (1937), starting with a prior probability law 
for o' of dcr/o-, has reached “Student’s” distribution for the posterior probability 
law for the mean; it follows that in this instance the fiducial limits of the direct 
approach associated with a confidence coefficient of, say, 0-95, will correspond 
exactly with those obtained from Jeffreys’ posterior distribution and associated 
with this probability measure of 0-95. It is to be hoped, therefore, that if Mr 
Gordon obtains financial support for the lengthy computation required to produce 
tables of p and e, he will at the same time make some research into the corre¬ 
spondence of his limits for p with those following from the direct approach. 
By this means he would undoubtedly widen the range of persons who could use 
his tables with confidence. 
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ON THE DISTRIBUTION OR MAXIMUM LIKELIHOOD 

ESTIMATES 


By B. L. WELCH 

The properties of maximum likelihood estimates have been discussed by E. Y. 
Edgeworth (1908), R. A. Eisher (1922) and others. Most important is the property 
that in large samples a maximum likelihood estimate tends to he normally 
distributed with a variance which is given by a very simple formula and that no 
other estimate can have a smaller variance. If we have a random sample of n 
from a population whose probability law, p[x [ 6), depends on only one parameter 
6, and if T is the maximum likelihood estimate of 8, then under certain conditions 
it may be shown that T is in large samples normally distributed about 6 with 
variance 1 jnA e , where 



Similar formulae are available for the variances and covariances of estimates 
when there are several parameters, but only the single parameter case will be 
discussed here, 

When dealing with small samples the maximum likelihood estimate T is 
frequently adopted together with the method of approximating its distribution 
by referring it to a normal curve with mean 6 and variance 1 jnA e . The question 
arises whether this is an adequate procedure. This question splits into two parts. 
First, we may ask how far we are likely to go wrong by assuming that T is 
actually distributed in the manner known to be correct in large samples: and 
secondly, we may ask how far the advantage which T holds over any other 
estimate in the matter of sensitivity in large samples is retained in small samples. 
The first of these problems has an interesting historical aspect. Karl Pearson 
(1936) has told us that because of his doubt of the adequacy of the approximation 
in finite samples he made no subsequent use of the above large sample variance 
formula originally given by him and L. N. G. Filon (1898). However, the real 
reason for his view that the approximation is not good perhaps lay partly in the 
fact that he had not noted that the formula referred only to maximum likelihood 
estimates—a point not made clear until Edgeworth returned to the problem in 
1908. Indeed this question of how good the approximation is cannot be answered 
very definitely, for there is, as a rule, no general agreement as to how close an 
approximation should be before it can be termed good. 

The second problem has been discussed extensively by R. A. Fisher (e.g. 
1925), who concludes that the maximum likelihood estimate still retains in 
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small samples desirable properties which recommend its use in preference to 
other estimates. 

The present note is concerned only with the first problem and, in connexion 
with this, it is not intended to imply that the approximation by the large sample 
formula is not good but rather to give a method which may supply a closer 
approximation if this be thought desirable. 

Let us write p{x 1 0) equal to exp [<f>(x, d )] so that the maximum likelihood 
estimate T is given by 

Z<f>'(x, T) = 0. .(2)* 

In general, the solution of this equation is not algebraically simple, although 
the numerical solution by iteration may not be difficult. It is therefore not 
possible to find the distribution of T , and indeed it is generally impracticable 
to find even the moments of -T. However, usually the actual distribution of T 
is not required, but simply a method of calculating the probability that T shall 
exceed any specified value T 0 (say). It is often possible to find the moments 
of another quantity which will facilitate such a calculation. 

If the sample be represented by a point in n-dimensional space, then all 
samples yielding the same value T 0 of T will lie on the hypersurface Z(j>'(x, T 0 )~ 0. 
Conversely, with most probability laws likely to be encountered, and certainly 
for that of the example given below, it will not be possible for this hypersurface 
to contain points yielding a maximum likelihood estimate different from T 0 . 
On one side of the hypersurface we may therefore expect T>T Q and on the other 
side T < T 0 . Now since by Taylor’s theorem 

Z<h'(x,T 0 ) = Z<f>'(x, T) + (T 0 -T)Zf(x > T) + etc„ .(3) 

and since by definition of T we have Z(/>'(x, T) ~ 0 and Z<j>"(x, T) negative, we 
shall expect T > T 0 on that side of the hypersurface for which Z<j>'(x, T 0 ) > 0. 
Hence if the maximum likelihood equation establishes a many-one relationship 
between sample points and values of T, we see that the probability that T>T 0 
is equal to the probability that Z(j)'(x > T 0 )>Q. But the quantity Zf>'{x, T 0 ) is 
the simple sum of a number of independent components, and its cumulants 
therefore follow simply from those of a single <j>'{x, T 0 ), Any of the usual methods 
of approximating to probabilities from cumulants may then be employed to 
evaluate P{Z<j)'(x, T 0 ) > 0} and hence P(T > T 0 ). 

As an example consider the probability law 

P(x\0) = ~e-W\ .( 4 ). 

which has been discussed at some length by Edgeworth (1908). We have 
6) = 4(x-6f and therefore 

Zfi(x,T 0 ) = iZ(x~T 0 f. .(5) 

* The dash denotes differentiation with respect to 0, and £ denotes summation over the whole 
sample. 
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Now by a straightforward method it may be shown that the first three cumulants 
of 4(® - T 0 ) a , when the true value of the parameter is 0, are given by 


where 


Kl ^ M ^U c{ e-T 0 ) + i(6-T 0 )\ 

*2 = /i a = 12c + (60 — 144c 2 ) (0 — P 0 ) 2 + 144c(0 - T^, 
K s = / h = 36(0 - T 0 ) [(5 - 12c 2 ) + (48c + 96c 3 ) (0 - P 0 ) 2 

+ (36 - 144c 2 ) (0 - To) 4 ], 

c = ^jfj = 0-33799. 


The cumulants of Z(f>'(x, T 0 )jn are then k v tc 2 jn and /c 3 /» 2 . By fitting a Pearson 
Type III curve with these three moments we can then approximate to 
. P{42'(a:-T 0 ) 3 /w>0}. If closer approximations are necessary further moments 
may be calculated and some other kind of curve fitted. 

More frequently, perhaps, the converse problem is posed of finding T Q such 
that P(T>T 0 ) has a specified probability. The above method then involves 
a certain amount of trial and error. For instance, suppose n - 25 and T n is 
wanted so that P(T>T 0 ) equals 0-05. A first approximation is obtained by 
taking T to he normally distributed about 0 with variance 1/25 A e , where 


I 


a, =J{ nx-effpb = 


4-056. 


( 7 ) 


The approximate standard deviation of T is then 0-0993 and T {) is 0 + 0-163. 
Substituting this value of T 0 in (6) we obtain /c x = -0-678, k 2 /25 = 0-210, and 
K z j 625 = -0-0392. The standard deviation of 4A7(m — (9 — 0*163) 3 /25 is therefore 
exactly 0-458, and the skewness - -0-407. Using these moments and the 
very convenient tables of L. R. Salvosa (1930) we find that approximation by 
a Pearson Type III curve gives P{ i£(x- 0- 0-163) 3 /25 > 0} = 0-056. From this 
it is not difficult to see that, by adjusting T 0 to the value 0+0-168, the Type III 
approximation to S<j>'(x,T 0 )ln will give P(42’(a;-0-OT68) 3 /25> 0} = 0-05, i.e. 
P(T> 0 + 0-168) = 0-05. 

There is in the present example little difference between the results given by 
the large sample approximation to the distribution of T and the Type III 
approximation to the distribution of E<p'(x, T 0 )jn. This is partly due to the 
nature ofp(x \ 0), owing to whose symmetry the maximum likelihood estimate, T, 
has a distribution which is exactly symmetrical with true mean at 0. The main 
approximation in using the large sample method therefore lies in the value 
adopted for the variance. Often the distribution of T will not be symmetrical 
nor have true mean exactly at 0. It is in such cases that a method which uses 
exact values of the lower order moments of T^)jn may be expected to 
yield the greatest improvement in accuracy. 








ON NEYMAN’S “SMOOTH” TEST FOR GOODNESS OF FIT 


I. DISTRIBUTION OF THE CRITERION f WHEN ' 

THE HYPOTHESIS TESTED IS TRUE 

By F. N. DAVID 

1. Introductory 

The y 2 test has passed into common use since its introduction by Karl Pearson in 
1900. It has, however, long been recognized that in certain cases it is inadequate 
as a test for goodness of fit, and in particular in the case when the deviations of 
the observations from the hypothesis tested are consecutively positive (or nega¬ 
tive) ; for the y 2 test, by taking into account the square of the difference between 
the observed and expected values, renders it impossible to pay attention to the 
sign of this difference. It is because of this inadequacy that Neyman (1937) 
introduced the “smooth” test for goodness of fit. This “smooth” test has been 
discussed by E. S. Pearson (1938) in some detail, which absolves us from further 
discussion here, but the procedure of the test will be set out in order that the 
substance of this present paper may be understood. 

Following the procedureof previous papers (1933,1936) Neyman (1937, p. 156) 
insists that a choice of a suitable test for goodness of fit may only be made after a 
statement of the probability functions specified both by the hypothesis tested and 
by the admissible hypotheses alternative to that tested. In order to supplement 
the y 2 test he confines the alternative frequency laws to a class of functions which 
he terms “smooth”, and whose form it is supposed can be represented by trans¬ 
formed Legendre polynomials, The keystone of his test lies in a probability 
integral transformation previously given by R. A. Fisher (1932) and K. Pearson 
(1933), by means of which the distribution of n independent random variables 
may be made rectangular.* 

The criterion f' 1 , which may be used instead of y 2 if the admissible probability 
laws are smooth, is designed to test the departure of the distribution of the trans¬ 
formed variables from rectangularity. Before it may be applied, however, it is 
necessary to choose what may be termed the order of the test, bearing in mind 
which departures from the hypothesis tested it is most important for the investi¬ 
gator not to overlook. The question of the appropriate order of test to choose has 

* For example if 

p(y) - constant ioi a<y<b\ 

=0 otherwise j 

we should say that y was distributed reotangularly in the interval [a; 6]. An alternative method of 
expression would be to say that in the interval [a; b] all values of y are equally likely, 
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been discussed to some extent by Neyman (1937) and E. S. Pearson (1938), 
Further investigation into the matter has been carried out and the results will bo 
published in a second part of this paper. Once the order of the test has been 
decided, then the calculation of the criterion is a comparatively simple matter 
and may be set out in the following way. 

Assume that there are n independent random variables Xj (j- 1, 2, 
following a known continuous probability law which is specified by // Q , the 
hypothesis to be tested. By means of the relation 

rcc 

2/ = p{xjH a )dx, .(1) 

J GO 

the n independent variables x are transformed into n independent variables y 
which are rectangularly distributed, whatever the probability law of the x's. If it 
is decided that a test of the &th order is to be applied, the next step involves the 
calculation of k quantities u { (i = 1, 2, k) by means of the substitution of the n 
variables y which were obtained from (1) into the appropriate Legendre poly¬ 
nomials, 7 t { , where „ 

Ui = rr* £ Ttifyf) i * 1,2, ...,7c. .(2) 

The first four Legendre polynomials* are 

ir 2 (2/)“V 5 (%-i) 2 ~i) ff 4 (jr) = 210(y-i)*-48(y-i)»+t| . 

The criterion iff 2 consists of the sum of the squares of these u' s, thus 

=^s(s^)) a - .W 

Any such criterion is, however, of little use if its distribution is not known, An 
approximation to the distribution was given by Neyman and it is this approxi¬ 
mation which is reviewed in the next section. 

2. DISTRIBUTION OF f 2 

It was shown by Neyman that when n, the number of observations in the 
sample, is large, then each of the quantities u\ (i= 1 , 2 ,..., h), is distributed 
independently as y 2 with one degree of freedom. It will follow therefore that for 
large n the criterion fr 2 of a kih order test will be distributed as y 2 with k degrees 
of freedom. As with many statistical tests based on the assumption that n is 
large, the decision as to the numerical value of n is left to the user of the test and, 
with the introduction of the personal element, we get a variation in ideas both as to 

* It is of interest to note that a recurrence formula for these polynomials follows direotly from 
a general formula for Legendre polynomials (see for example Whittaker & Watson, Modem Analysis, 
p. 302, n). If z~y~\ then 

V(Sr3) *n + l(*)~2z V(2«+i) ^nW+^ri) 
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what is meant by a large sample and the point at which a large sample becomes a 
small sample. It is therefore useful to obtain some idea of the degree of approxima¬ 
tion involved in the supposition that ijr 2 is distributed as y 2 . Although this may 
only be investigated completely by the discovery of the true distribution of i i/r 2 
it is not difficult to find Pearson curves which have moments agreeing with the true 
moments of and these may be used to throw light on the approximation to the 
y 2 distribution. 

The derivation of the true moments of ijf 2 is a matter of simple but tedious 
algebra. Using the notation#’ to denote mathematical expectation, it is seen that 
we must calculate 

ft, = #(^ 2 -#(f 2 ))' for £=1,2,3,4, .(6) 

where \jf' 1 is as defined in (4) and the y' s are independently distributed within the 
interval [0; 1] in accordance with the rectangular law, I have calculated only the 
moments of i// 2 for first and second order tests, partly because numerical work* 
leads me to believe that these order tests are more powerful to detect any depar¬ 
tures from the basic hypothesis H a than any other higher order test, and partly 
because the necessary algebra became so very heavy. 

Prom the properties of the transformed Legendre polynomials we know (see 
Neyman, 1937) that 

* J q v t (y f ) dyf = 0 for any i and j. (6) 

Also since the observations are independent of one another 

.( 7 ) 

and =J o tf!(%)<% “ L ( 8 ) 

In general it will be noticed that we require to calculate 

( n \ 2! 

s TTiiyM for i = 1, 2 and t = 1, 2, 3, 4 .(9) 

and 

« n \2p / n \2a\ 

j | j for i, m = 1, 2and2, 3. 

.( 10 ) 

The calculations required are straightforward and follow at once from the applica¬ 
tion of the elementary theorems concerning the sum and product of expectations. 
The final results are given below in Table I. 

It will be noticed that for n small the moments of and f\ differ considerably 
from those of y 2 . It is, however, only fair to point out here that Neyman supposes, 
as in the case of the y 2 test, that a “smooth ” test for goodness of fit would not be 

* To be discussed in Part II of this paper, 


Biornetrika xxxi 


13 









194 


Neuman's “ Smooth ” Test 


TABLE I 


True moments 
of 

1 Ai='Ui> {k= 1) 

(k=2) 

Pi 

fa 

1 

2 1 


2 

35 n 

Mi 

8-3 + S 

5w 7 n- 


704 722208 

+ 35035w 2 

Mi 

6 0~— + 75 — ~ 
W 5n 35w* 

432 

5n 3 

,,, , 15216 2203468 17946980 

144 + 49 n 35035w 2 119119n 3 

i 


carried out except on a large number of observations. It seems to be possible by 
an approximate method to obtain some idea of how large n should be. 

A consideration of the /^-coefficients calculated from the moments of Table I 
suggests that the most appropriate curves to use would be a Pearson Type I for f\ 
and Type VI for f\. Since, however, it was hoped that both curves would approxi¬ 
mate quickly to the y 2 Type III distribution, as a first step Type III curves with 
the start of the curve at the origin were fitted to the moments of both \fr\ and f\. 
As there are only two constants to evaluate for the Type III curve with zero 
start, the procedure was equivalent to finding a y 2 distribution with the mean and 
standard deviation equal to the corresponding true values for f 2 . For purposes of 
comparison, the 5 % and 1 % levels of y 2 with/ = 1 and/ = 2 were taken, and the 
tail areas of the fitted Type III curve corresponding to these abscissae were 
calculated from Tables of the Incomplete T-Function (K. Pearson, 1922). The 
results are summarized in Table II. 

TABLE II 


/=!, ^i=6-635 



5 

1-76 

0-0523 

0-0098 

10 

1-88 

0-0511 

0-0099 


■ 

B 

B 


/=2, xS.m=6’991, x5.oi=9‘210 

n 

Vi 

P{fl> 5-991} 
P{fl> 9-210} 

5 

3-8171 

0-0511 

0-0100 




nn 

100 

3-9909 

0-0500 

0-0100 
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The probability levels of fr 2 as given by the empirical Type III curve approxi¬ 
mate quite quickly to those of x z , and at first sight it would appear that for any 
size of sample little risk would be involved in testing the significance of by 
means of the y 2 tables. However, it must not be forgotten that the empirical curve 
is itself an approximation to the unknown true ?// 2 distribution. To obtain some 
idea therefore of the approximation involved it is necessary to calculate the 
/?! and of the fitted Type III curves and compare them with the y?’s of the 
true distribution of and of the y 2 distribution. These values of the /?’s are 
plotted in the diagram and their numerical values are given in Table IV. It will 
be noted that the yd’s of the fitted Type III curves lie nearer the y 2 points (8, 15) 
and (4, 9) than do those of the true fr 2 . Hence, while the results of Table II suggest 
that the approximation of the fs 2 distribution by a Type III distribution is a, 
fairly good one, it would appear that further investigation is advisable before any 
definite conclusion may be drawn. 

It was therefore decided to utilize the true third moments of f'K Type I 
curves with zero start were fitted using the first three moments of and similarly 
Type VI curves using those of fr\. It will be recognized that the procedure for the 
fitting of a Type VI curve and the evaluation of its integral is very similar to that 
for the fitting of Type I, and the calculations required were almost the same for 
both curves. Accordingly it is necessary to set out the steps followed for the case 
of Type I only. 

If the equation of the Type I curve is written as 


Z 


1 r(in^ + m 2 2 ) 

a m i +1 T'(m 1 + 1) f(m 2 +1) 


7 % 1 


7V»* 

a J 


.( 11 ) 


easy algebra will give 


M' x =a 


K+i) 

\m 1 + m 2 +■ 2) ’ 


M' 2 = aM[ 


9 %+ 2 ) 

(mi + mj+3)’ 


Mg = «m; 


(ffli + 3) 
(m 1 +m 2 + 4)’ 
.( 12 ) 


where M' v M 2 and M' 3 are the true moments of f\ about the origin. It was not 
found possible to write down the solutions of a, m l and m 2 in any neat form, and 
the procedure followed was to substitute numerical values for M' v M' 2 and Mg at 
an early stage. The partial integral of (11) is recognized as an Incomplete Beta- 
Function. Hence to calculate the tail areas corresponding to y§. 05 and y§. 01 it will 
only be necessary to refer to Tables of the Incomplete Beta-Function (K. Pearson, 
1934). Table III gives the results of interpolation into these tables. 

It will be noted that for strict accuracy triple interpolation would be necessary. 
This, however, called for much calculation, so an approximation was made by 
taking = - 0-50 and using a double interpolation formula only. The effect of 
the approximation is to make the interpolated values slightly greater* numerically 
than those values which would have been obtained by the use of triple inter- 
* A rough form of quadrature for the ease/=l, n ~5 gives f 3 {>/ r ?>3 , 841}=0-0481. 


13-2 
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TABLE III 


/=!, xL,-.=3’841, 6-636 

/= 2, mU=5-991 

xLi=9'210 

n 

5 

PH 


n 

5 

10 

20 

i*t 

6-3944 


7-2971 

* 

18-0489 

17-2312 

16-6668 

a 

mmrnim, 

46-3643 

96-2366+ 

a 

16-2813 

30-8654 

60-3289 

Htty 

WSmmm 


EB7I 


0-29947 

0-154503 

0-07825- 

m, 

8-63986+ 

■grag 

46-07631 


10-57863 

17-81708 

33-52469 

P{f}> 3-841} 


|KSiMB§§ 

0-050- 

P{fl> 5-991} 

0-047+ 

0-049+ 

0-050- 

P{ir{> 6-636} 

■9 

HI 

o-oio- 

P{f|>9-210} 

o-oio- 

o-oio- 

o-oio- 


TABLE IV 



Type of curve . 



g 


fl 

True values 

A 

5-34 

u 

1 




A 

9-84 

■ 

■ 

11 


Pitted Type III 

A 



in 

iB 



A 

.. - 


BH 

Hi 


Pitted Type I 

H 

5-34 

6-61 

7-52 

7-86 



H 

9-94 

12-30 

13-89 

14-71 

f. 

True values 

i 

5-86 

4-97 

m 

4-10 



ft 

14-06 

11-43 

KH 

9-24 


Pitted Type III 

H 

4-30 

4-14 


4-01 



H 

9-45 

9-21 

9-12 

9-01 


Fitted Type VI 

A 

5-86 


4-49 

4-10 



A 

13-74 

11-37 

10-17 

9-23 


polation formulae. As n increases % gradually approaches the value - O’5. For 
values of n greater than 20 the required Beta-Function ratio fell outside the range 
of the existing tables. These values therefore could not be evaluated except by 
quadrature but the results for n = o, 10 and 20 give sufficient indication for our 
purposes of the approximate error involved. A system of triple interpolation was 
carried out for the evaluation of the integral of the Type VI curve hut the calcula¬ 
tions became so very heavy that it was only found practicable to obtain the result 
correct to three decimal places. The results of interpolation were confirmed 
approximately by a rough quadrature of the curves. The tail areas of the Type I 







































198 Neyman's “ Smooth ” Test 

and Type VI curves beyond y 2 .^, 8 ,f o seen to be slightly less than 0- 05 but nearer to 
that value.than the corresponding areas from the Type III approximation shown 
in Table II. On the other hand the Type I and Type VI areas beyond xUi are a 
little further from 0-01 than for the Type III approximation. Except perhaps in 
the case of n = 5 of the Type I curve these differences can hardly be regarded as of 
importance. To judge how closely the Type I and Type VI curves are likely to 
approximate to the unknown true distributions we may compare their /? 2 values 
with the true values. This has been done in Table IV and in the diagram. The 
proximity in position between the solid and dotted circles in the latter, suggests 
that the Type I and Type VI curves must represent the true and \jr\ distributions 
very closely. 

Thus the probability levels of the Type I and Type VI curves given in Table III 
may be taken as lying not very far from the true values, and this shows that no 
great error will be made if we assume for samples of 20 and over that Neyman’s 
f 2 criterion of the first and second orders are distributed as y 2 with one and two 
degrees of freedom respectively. 

3. Sphere oe usbeulness of the “smooth” test 

It was pointed out in § 1 that where the deviations of the observations from the 
hypothesis tested are consecutively positive (or negative) then the )// 2 “smooth” 
test for goodness of fit may be applied. It does not seem possible to give any 
definite rule as to when it might be used in preference to the y 2 test. Neyman 
remarks that it would be interesting to enquire into the relative workings of the , 
y 2 and f 2 tests, possibly with this point in mind, but nothing has been written 
on the subject and it is difficult to see how a comparison of the powers of the two 
tests could be obtained. 

While recognizing the ingenuity of Neyman’s 1 ‘ smooth ’ ’ test, and the attempt 
it makes usefully to supplement the y 2 test, the present writer feels that as it 
stands it is not wholly satisfactory, and that much work remains to be done before 
it can pass into common use. It has been pointed out in § 1 that Neyman (1937) 
and E. S. Pearson (1938) have both discussed the appropriate order of test to 
choose, without however reaching any definite conclusion. It has been possible 
to remedy this in part by confirming E. S. Pearson’s suggestions through numerical 
calculations, but an extension of the theory appears necessary before the test is 
generally applicable. 

In § 2 it has been shown that serious error will not be made if the use of the 
smooth ’ test is restricted to samples of twenty and over, and Neyman himself 
says that he would use the test only if the available sample is large. At present, 
however, the application of the test presents rather a formidable task for any 
computer. For example, if the sample is of size 100, and it is desired to apply a 
test of the second order, the numerical work would entail 100 entries into tables 
of the appropriate probability integral with possible interpolations and 100 
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substitutions into each of the first and second polynomials. Hence for a sample 
greater in size than 100 the labour of computation involved will incline the 
computer towards the simpler and more familiar y 2 test. The need for such a 
“smooth” test cannot be denied, but it would have greater utility if it could be 
applied after some grouping of the observations has been made. Whether this is 
a possible development or nqt I do not know. 

At present it is only possible to apply the “smooth” test when the parameters 
of the hypothesis tested are known, whereas the y 2 test permits one to calculate 
them from the data, as for example in the fitting of Pearson curves. What the 
effect of calculating parameters from the data would be on the criterion I 
cannot at present suggest, although it is hoped to throw light on this point by 
means of a sampling experiment. Certainly if parameters are calculated from the 
data and used in the basic hypothesis, H 0 , it would mean that the variables y 
would be no longer independent or rectangularly distributed. Whether it is 
possible to determine what form the probability law of y would take, given 
restrictions on the original sample, remains to be investigated, 


I have to thank Prof. E. S. Pearson for helpful criticism and Miss J. Townend 
for drawing the figure. 
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TESTS OF HYPOTHESES CONCERNING 
LOCATION AND SCALE PARAMETERS 


By E. J. 6. PITMAN 
University of Tasmania 

1, The comparison op location parameters 
Suppose that k observed numbers 

^li #2> •••> 

are values of k chance variables, and that the elementary probability function of 
the simultaneous distribution of the chance variables is 

the function F being of known form but the values of the location parameters 
a h a 2 ,.,.,a k being unknown, Suppose, further, that we, wish to test the hypothesis 

tllat <*1 = 09= ... =a ki 

which we shall call the hypothesis & 0 . Any test must be based upon some statistic 
J which is a function of the observed values x v x 2 ,x k , and H a will be accepted 
for certain values of J and rejected for all other values. If the observations are our 
only source of knowledge of tbe values of the location parameters, a satisfactory 
test must give the same answer when the observed values are 

«i+A,a: a +A,...,a: A +A, 
as when they are x v %,.,,, x k . 

Hence the statistic J must have the property 

J(xj+A, x k +A)~J{x v .,,, x k ). . (1) 

Without loss of generality, we may assume that J is always positive, and that 
a small value of J is regarded as significant, i.e. as indicating that the hypothesis 
is untrue. If this is not so, we simply replace J by some suitable function of it. 
In order to use the observed value of J as a test of H 0> we must know the distribu¬ 
tion of J when H 0 is true. Since J has the property (1), its value will be unaffected 
by the same change of origin of all the x. Hence, when the a are all equal, we may, 
without loss of generality, assume that their common value is 0. We require then 
the distribution of J when 

— $2 = ... a a k “ 0. 

Let p be any given number (such as 0-95) between 0 and 1. When this distribution 
of J is known, we can determine 6 such that, when H 0 is true, the probability that 
JkQisp. Having chosen p, we reject the hypothesis # 0 if the observed value of 
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J is less than 6, and accept it otherwise. The probability of rejecting H 0 when it is 
true will be 1 —p. If J is so chosen that for a fixed value of 6 the probability that 
J A 6 is greatest when H 0 is true, the hypothesis H 0 will be more likely to be ac¬ 
cepted when it is true than when any alternative hypothesis is true. A test which 
has this property is said to be “unbiased” (Neyman & Pearson, 1936, p. 8). Let 
us attempt to determine J so that the resulting test is unbiased. 

A set of values of the x may be specified by a point (the sample point) whose 
rectangular co-ordinates in a k dimensional space are (x v x 2 , ..., x k ). For the co¬ 
ordinates of a variable point in this space we shall use (£ u £ 2 , ..., £ fc ). Since J has 
the property (1), it will be Constant along any line parallel to the line 

.( 2 ) 

and therefore the region A , consisting of all points for which J^d, will be bounded 
by a cylindrical hypersurface with its generators parallel to the line (2). When 
H 0 is true, the probability that J > 6 is 


P 




k> 


and when H 0 is not true, the probability is 


-f *(5,.{»)«, 

where B is a region formed from A by translation without rotation. Thus, the 
necessary and sufficient condition for an unbiased test is that the integral of 
F(^ v ...,£*.) over the region A be greater than its integral over any equal and 
parallel region B. 

If L is any line parallel to the line (2), we shall write 
P(L)=jj(£ v ...,£ ]c )d V , 
where q = 

the distance of the point (£ 1# £ a ,..., from the plane = 0, and the integral is 
taken along L. Now if A is defined as the locus of all lines L for which P(L) is 
greater than or equal to some constant h, P{L) will be less than h on any line L 
in B which is not also in A . From this it easily follows that 

f *&,...,&)«!...«*>f i- 

JA J B 

and so the resulting test is unbiased. Hence we shall have an unbiased test if we 
define J at any point as equal to P(L) for the line L through that point. 




202 Location and Scale Parameters 

The co-ordinates of a variable point on the line L through the point 

(%! * 2 ! •■’>%) 

may be expressed in the form 

l r ~x r -t (r = 1,2,,,,, k) 


Ex 

where ^ ~ X 

The expression for P{L) becomes 


1 



Ffa-t, ...,x k ~t)dt. 


Instead of P{L) we shall take P{L)j^k for J, and, replacing the symbol l by the 
symbol a , we have 


f* CO 

J = F(x x -a, ...,x k -a)da. 

J -CO 


The test based on this J will be unbiased. 

As a simple illustration, take the case where the x are independent normal 
variables, each with unit standard deviation, and the mean value of x r is a r . 


F(x x ~a l> ...,x k -a k ) = exp {- \E(x r - a r ) 2 }, 

1 f" 

therefore J = ^ exp {- %E{x - «) 2 } da 

g-iS 

“*»( 27 )^’ 

where S = E{x r -x) i > x~Ex r lk. 

In practice we would take S, which is a monotonic function of J, as our criterion, 
large values of S being significant. When H 0 is true, S is distributed like y 2 with 
fc - 1 degrees of freedom. 

Suppose now that the hypothesis to be tested asserts not merely that the 
location parameters of the chance variables x are all equal, but that these location 
parameters all have the same particular given value. By a change of origin of all 
the x this particular value may be taken as 0, Thus the hypothesis to be tested is 
that „ A 

which we shall call the hypothesis H' a . It is obvious that if we take 

J' = F(x v ...,x k ) 

and regard small values of J 1 as significant, the test will be unbiased. Moreover, it 
is very easy to show that, if the x are independent, so that F is the product of the 
separate elementary probability functions, and if each elementary probability 
function is unimodal, then, when H’ a is not true, the probability that J'^d 
increases as any a moves towards the value 0. 
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2. The comparison of scale parameters 
Here the elementary probability function of the chance variables x is 

IJ{llc r }F{x 1 lc 1 ,..„x k lc k }, 

where the c are all positive and we wish to test the hypothesis, 


c i - c 2 - C k> 

which we shall call the hypothesis H v Let us assume for the moment that the x 
are all positive chance variables. Putting 

y r = log x r> a r = logc r , 

we have for the elementary probability function of the chance variables y 

exp (2y r - Ea r ). Fjexp (y x - a x ),..... exp (y k - a k )}. 

The hypothesis H x applied to the parameters c is equivalent to H n applied to the 
parameters a. We shall therefore have an unbiased test if we take as our criterion 


J = 



exp (2y f - ka ). F{exp (y x -a ),..., exp (y k - a)} da 


= n K} f 


c fc+i 


dc. 


The expression for J in the general case where the x are not necessarily positive 
chance variables is 


J ~ IJ{\ x r \) 


1 J{* 1 /c,.,.,a;Jc} 




dc. 


Small values of this are significant. The axial planes divide the k dimensional 
space into 2 k regions, and the probability of the sample point falling in any one of 
these regions is independent of the particular values of the parameters c. By 
considering these possibilities separately and using the result for the case of all 
positive chance variables, it is easy to see that the test is still unbiased. J is a 
homogeneous function of degree 0 in the m; hence, when H 1 is true, its distribution 
is independent of the particular common value of the scale parameters c. Therefore 
in determining this distribution we may'take the common value to be 1. 

From the corresponding result of §1, it follows that an unbiased test of the 
hypothesis H[, that 

c i “ C 2 = ~ C fc ” 

will be obtained by using 

j' ^n{\x r \}.F{x v ...,x h ], 


small values of which are significant. Again, from the corresponding result of § 1 
it follows that if the x are independent positive chance variables with elementary 
probability functions c~ l f r (x r jc r ) such that xf r (x) is a unimodal function of x, then, 
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when H[ is not true, the probability that J'^.0 increases as any c moves towards 
the value 1. 

Tests of other hypotheses will be discussed in a later paper. The remainder of 
this paper will be devoted to a discussion of the most important application of the 
results of this section. 

The problem of deciding whether k samples, each known to have been drawn 
from a normal population, have been drawn from populations with the same 
standard deviation has been discussed by Neyman & Pearson (1931), who have 
proposed a criterion X Hl for this purpose. Applying the method of this paper, we 
obtain a test which is the same as that proposed by Bartlett (1937, p. 273). 
Bartlett’s test is therefore unbiased. It can be shown that the Neyman-Pearson 
test is unbiased only when the number in each sample is the same, in which case 
it is exactly equivalent to Bartlett’s test. 

A continuous chance variable which takes values from 0 to co with elementary 
probability function 1 

_ o-x v m-i 

r(m) e 

will be called a J’(m) variable. If its elementary probability function has the more 
general form 1 


Ml—1 


__ er x H ~ I 

c.r(m) \c/ ’ 

we shall call it an unsealed V[m) variable with scale parameter e. If 

Vv c/21 •••Sin 

are n numbers with mean y, we shall call the expression 

S = i(y r -W 

1 

their squariance.j We know that if the y are a sample of n values from a normal 
population of standard deviation cr, $/n 2 is distributed like y 2 with n— 1 degrees 
of freedom, or, what is more convenient here, |$/cr 2 has a P{|(w -1)} distribution, 
so that S is an unsealed I\^(n -1)} variable with scale parameter 2cr 2 . Moreover, 
in this case, if the mean of the population is unknown, the whole of the information 
about cr supplied by the sample is contained in the value of S. If the mean of the 
population has the known value a , 


X (2/r - «) 2 
1 

is a sufficient statistic for the estimation of cr; it is an unsealed jP(|») variable with 
scale parameter 2<x 2 . Hence questions about variances of normal populations can 

f If this word is objected to, I hope that someone will coin a better, for it is very convenient to 
have a name for this expression. The usual “sum of squares” is not precise in meaning and is 
awkward in use. As I have pointed out previously (Pitman, 1937, p. 215), it is the squarianoe of a 
sample of n that appears naturally in the mathematical theory, and not the variance S/n, or the 
“estimated variance” Sj{n— 1). The last, however, becomes prominent in the applications. 
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be reduced to questions about scale parameters of unsealed F variables, In 
particular, if we have k samples with squariances 

$j, $ 2 , 

and if each sample has been drawn from a normal population of unknown mean 
and variance, the question “Are the variances of the normal populations all 
equal? ” is equivalent to “Are the scale parameters of the unsealed F variables 

$ 1 j $lc 

all equal? ” The problem of answering this latter question we now proceed to 
consider. 

3. Application to gamma variables 
Suppose that x 1 ,x i ,...,x k 

are k observed numbers, that x r is a value of an unsealed F(m r ) variable with scale 
parameter c r , and that the k chance variables are independent. We wish to test the 
hypothesis H v that 

c y — c 2 — c )t . 

Put M = Em. 


Except where explicitly stated otherwise, all summations E and all products FI 
are to be taken over the k values of the operand, which will be written without 
an index. Thus k 

Em = £ m,,, II {af 1 } = n {x 

r=1 r =1 

The elementary probability function of the distribution of the x is 


hfence 




/ = 


n{cF(m)} 


n{x m } 

n{F{m)} 


i; 


g -{Xx)lo 


do 


F{M) ri{x m } 
FI{F(m)}{Ex) M ' 


It is sometimes more convenient to deal with 


K n{x»} 

(Ex) M> 

which differs from J only by a constant factor. The maximum value of K is 

FI{m m } 


and we shall denote the logarithm of the ratio of this maximum to the value of K 
by L , so that 

L - M log (ExIM) - E{m log (a;/m)}. ! 

It is necessarily positive or zero, and large values are significant. 
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For testing the hypothesis H' v that 

c i = C 2 = ••• = c fc ~ 1> 

. , , , II{x m } e~ Sx 

the appropriate criterion is J — ~jy| Tjm)y ' 

We shall write K’ = er**n{x% 

L' = log (X jJ - log K' = Zx-M-£{m log 

4. The distribution of K when H l is true 

As explained above, when B x is true we may take the common value of the c 
to be 1. The probability that K S k is 


(Tr¬ 


over the region where 
Make the change of variables 

Zx - u, 

x r = uy r , (r» \,2,...,k-\) 

it- i 

and write for convenience y k ~ 1 - £ y r> 

i 

so that Zy = 1 and x k - uy k . 

Then integrating with respect to u, which is not involved in the boundary condi¬ 
tion, we obtain for the value of p 

r(M) 


I7{P(m)}, 


Pif 1 ' 1 } % • • • diJk-i over Il{y m ) g /c. 


As the maximum value of K is /7{(m/-/¥) M }, we are concerned only with values of 
k less than this. It should also be noted that in the applications to normal variables 
only integral and half-integral values of m v m 2 ,... occur. 

When k = 0, p = 1, and the region of integration is 
0§y r §l, (r = 1 j 2,/c — 1) 

ft-1 

OS Sws 1. 


Therefore J LI{y m ~ 1 }dy 1 ... dy k _ x 

over this region is equal to 


m m)} 

P{M) ’ 

which may be denoted by JS(m 11 m 2 , It is the extension of the complete 

beta function, 


rim^rims) 


1 
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When 1c — 2, the expression for p is 

1 f*s 

-57-- v x m ir 1 (l-x) m *~ 1 dx l 

where = xfi(l-x 2 ) m * - k. 

In the tabulations in § 5 for h = 2, the value of x x is given. 

When k = 3, 

f * W^H) J 3 ' 5 "'' X ~ y '~ S!r " 1 dyidV ‘ 

over yi i y™ii-y 1 -yz) m *'ii<- 

By the substitution y 1 = uv 

y 2 = u{\-v) (0£t>£l), 

this becomes 

1 f 

p =-^1 - x u m i +m z- 1 (l — u) m *- 1 v nl i~ r (l — v) m *~ 1 dudv .(3) 

Byirifi m 2 , m a/ J 

over u m i +m >( 1 - u) m » v m i(l - v) m ‘ ^ k. 

If m x - m 2 = m 3 = m, and m is integral or half-integral, this is expressible in 
terms of complete elliptic integrals; but even when m = 1 the reduction to 
standard form is tedious and I have not carried it out. However, Nair (1939) has 
shown how to obtain the exact distribution in the general case 


m 1 = m 2 = ...= m k . 

It is of some interest to determine the exact distribution in other cases in 
order to check the accuracy of the approximate distribution discussed in § 5. 
A case which happens to be fairly easy to handle is 

m 1 — m 2 7 = m, m 3 = 2 m. 

From (3) 

P n,• ——x—i \u* m ~Hl-u) lm ~ 1 v m ~' l (\-v) rn ~ 1 du<lv 

over u 2 ( 1 - uf v( 1 - v) '£ K llm . 

Putting %- |(1+ *) (-lg*gl), 

» = i(! +y) ■ 

we obtain 


over 
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which is fairly easily expressible in terms of complete elliptic integrals when m 
is small and integral or half-integral. When m= 1, so that 

m 1 - m 2 = 1, m 3 = 2, 

p = |f(l- x 2 ) dxdy over (1 - x 2 ) 2 (1 - %f) ^ 64/c 


^{(l-x 2 ) 2 -64 k}(Ix over (1-x 2 ) 2 g; 64/c 


where 

This reduces to 


^/{(a 2 -r- x 2 ) ( b 2 - x 2 )} dx, 
i 

a 2 — 1 ~S^Jk, b 2 = 1 + 8 ^k. 


Cu. Ih2_ r 2 fa 

J, 

= 6j*" - ( 2 sin 2 ^) d<l> - 8 b 




V{(« 2 -x 2 ) (6 2 —x 2 )} 
d(j> 


lo -t 2 sin 2 (j>)\ 

where t ~afb. In the table in § 5 the value of f? ~ arc sin t is given in order that the 
results may more readily be checked from tables of the complete elliptic integrals. 


5. The semi-invariants on L and the approximate distribution 
OP L WHEN Hy IS TRUE 

We shall denote the sth semi-invariant of a chance variable Z by A S {Z). If x 
is a Am) variable, A,(log x) = G s {m), 

where G s [m) is the sth derivative with respect to m of 

G{m) = log T(m). 

From this we can easily show that the semi-invariants of 
L = M log ( ExjM) - 2{m log (x/m)} 
are A 1 (Ir) = df{V? 1 (i¥) - log M) - Z{m[Gdm) - log m]}, 

« > 1, A S {L) = (~y {Zm*G s {m) - M°G S (M)}, 

Using the asymptotic expansion of G s (m) and neglecting terms of order 1/m 2 , we 


obtain 


where 

ti 

* 

11 

1 ^ 


1 + a 

4 



Hence L* is approximately a P{^(h -1)} variable. 2 L* is distributed approximately 
like y 2 with 1 -1 degrees of freedom, which is Bartlett's result (1937, p. 274), with 
a different notation. 
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The corresponding result used for testing H[ is that, when H[ is true, 

2L* = T~~ 

1 + a 

is distributed approximately like y 2 with k degrees of freedom, where 

»- 

In the tables below, P is the true probability of obtaining, when H x is true, a 
value of 2L* as great as or greater than that shown. P' is the approximate prob¬ 
ability calculated from the approximate distribution of 21*. Although in praotice 
we are concerned only with the upper tail of the distribution of 2L*, it is of some 
interest to see how the approximate distribution as a whole compares with the 
true distribution, f It will be noted that the approximation improves as a 
decreases. The meanings of the symbols x l and d are explained in § 4. 

TABLE I 

Comparing true and approximate probability integrals 


Wj=%=i, <*=£ 






)'00401 

3-01608 

3-15090 

D-46029 

1- 07735 

2- 65717 

3- 72464 

6- 26726 

7- 37225 



m 1 = 1, m 2 =2, a= 


%=1, %= «>, *=i 



f See Bishop & Nair (1939), where the results of a very thorough investigation of the upper 
tail of the distribution are given. The computations of the present paper were completed before 
the publication of Bishop & Nair’s paper, 

Biometrika xxxi I 4 
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TABLE I (cont.) 




> a “Af 


w 1 =w 2 =l, t»3 = 2, a=j 

V 

X 1 

2 L* 

p 

?' 

0 

21* 

P 

F 

0-49 

0-003809 

0-95081 

0-96079 

7° 

0-10005 

0-9517 

0-9512 

048 

0-015252 

0-90177 

0-90171 

10“ 

0-20321 

0-9043 

0-9034 

043 

0-188518 

0-66432 

0-66415 

19° 

0-71677 

0-7004 

0-6988 

0-39 

0-472482 

0-49205 

0-49185 

27° 

1-40868 

0-4952 

0-4944 

0-31 

1-485259 

0-22306 

0-22295 

36° 

2-42738 

0-2958 

0-2971 

0-25 

2-739828 

0-09785 

0-09788 

50“ 

4-53262 

0-1001 

0-1037 

0-21 

3-905482 

0-04806 

0-04813 

57° 

5-88759 

0-0493 

0-0627 

0-14 

6-958488 

0-00828 

0-00834 

69° 

9-02451 

0-0093 

0-0110 

041 

8*928707 

0-00277 

0-00281 

76“ 

11-32588 

0-0027 

0-0035 


6. Applications 

Suppose that we have k samples, each drawn from a normal population of 
unknown mean and variance, and that we wish to test the hypothesis that the 
variances of the normal populations are all the same. Denote the number of 
members of the rth sample by n r +1 , and the squariance of this sample by S r \ 
then S r is an unsealed F(\n r ) variable, and the hypothesis to be tested is equivalent 
to the hypothesis applied to the variables S r . Thus the appropriate criterion 
ist 

L = pHog [£SlkN)~£{%nlog (Sfyn)}, 
where N = En. 


A large value of L is significant. 

2L = A log (ESIN) - E{n log (Sjn)}, 


2 L* = 


2 L 
1 +a 5 


where 


1 f r l 1\ E(ljn)-llN 
jNf- 3(k~ 1) 


When H 1 is true, 2 L* is distributed approximately like with k-1 degrees of 
freedom.;]; Trom the tabulations in §5 it is evident that even with very small 
samples the approximation is sufficiently good for us to use it to determine 


t The difference between this expression, L, and the function L x as defined by Key man & 
Pearson should be noted. The relation is referred to on p. 212 below as well as being discussed by 
Bishop k Nair (1939). 

f Cf. Bartlett (1937, p. 274). 
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whether an observed value of L is significant or not. It may be noted that if we 

v r = S r jn r , 


put 


the “estimated variance” derived from sample r, the expression for 2 L is 

2 L = N\ogv~E{nlogv), 
where v = EnvjN. 

Hence L has its minimum value 0 when 


Vl = v 2 =... = % 

and dLjdv r has the same sign as v r - v, and therefore L increases as the v diverge 
more and more from a common value. Thus the L test may be regarded as answer¬ 
ing the question “Are the estimated variances % v k significantly different ? ” 

If we wish to test the hypothesis that the variance of each population is 
unity, we obtain the appropriate criterion from the fact that this hypothesis is 
equivalent to the hypothesis H' applied to the variables hS r , The criterion is 
therefore L < = S x S j og (^n)j, 

and we have 2L' = 28 - N - E{n log (Sjn)}, 


2L* = 


2 L' 
1+a’ 


a = 


E(l/n ) 
3 k ' 


When H' is true, 2 L* is distributed approximately like y 2 with k degrees of freedom. 

It should be noted that the field of application of the L test is quite different 
from that of Fisher’s z test, which is used in the analysis of variance. When, in the 
analysis of variance, we test for significance of treatment differences or significance 
of an interaction, the question we put is not ‘ ‘ Are these several estimated variances 
significantly different?” but “Is this particular estimate, or are any of this 
particular group, significantly greater than that particular estimated variance 
(residual variance) ? ” The truth is that, when engaged in the analysis of variance, 
we are interested not in variances of normal populations but in means of popula¬ 
tions. A straightforward treatment of an analysis of variance problem by the 
method of likelihood leads to a criterion function which, when the null hypothesis 
is true (treatments without real differences of effect, or no interaction, as the case 
may be), is certainly the ratio of two estimated variances; but the question raised 
is not “Is this ratio significantly different from I ? ” but “Is this ratio significantly 
greater than 1 ? ”f We are led to Fisher’s z test or an equivalent, and not to the L 
test. It is true that in the analysis of variance we have certain squariances S v 
$ 2 , ... which, when the null hypothesis is true, are unsealed gamma variables 
with equal scale parameters, and so the corresponding L will have the distribution 
discussed above; but this would not justify the use of the value of L as a test for 
the null hypothesis. In devising a satisfactory test we must consider the state of 

f That being so, it may be remarked in passing, to speak of a “negative mtoraction” in a way 
which implies that it has some significance is a breach of statistical good manners, it is questioning 
the referee’s decision after having agreed to accept it as final. 
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affairs not only when the hypothesis to be tested is true but also when It is false. 
In this case, when the null hypothesis is false, some at least of S v »S' a , ... are not 
unsealed gamma variables but have a different distribution, and so the proof that 
the test is unbiased no longer holds. When used outside the field in which it 
naturally arises the L test loses its theoretical satisfactoriness. If, for example, 
the totals for the various treatments were identical, the L test applied to variance 
due to treatments and residual variance would judge this significant, i.e. as 
indicating that the treatments produce real differences in effect, while the z test 
would rightly judge the result as non-significant. Improbable as such a result is, 
it is more probable when the null hypothesis is true than when it is false. It would 
seem that the L test must lose power by judging as significant results in which 
the variance due to treatments is small compared with the residual variance, and 
that the z test must be more powerful in this field than the L test, i.e. more likely 
to reject the null hypothesis when it is false. Wishart (1938) proposes to use the L 
test as a supplementary test in experiments of factorial design when the z test 
has given a non-significant result for treatments as a whole, but it seems unlikely 
that this combination of the tests will be as powerful as the z test used alone. 


7. The Neyman-Peaeson test 


The Neyman-Pearson criterion function for testing the hypothesis that the k 
normal samples have been drawn from populations with the same variance is 
(in our notation) 

_ I7{[5/(»+!)]«»+«} 

Hl “ {LS/iN+tyW+V )T 

small values of this being significant. Remembering that S r is an unsealed 
P(\n r ) variable, we see that this is equivalent to using the criterion function 


K , IJ{x m -M} 
(£x) M+ik 


instead of 


jr 

~ (Lx) M 


for testing the hypothesis H 1 applied to the variables x v x 2 , .... x k of §2. 

Denoting by p' the probability that K' > k', we can easily show that when 


dp' 

3cj 


over the region 


c i - c 2 


c, £ - c, 


cii{r(m)} m 6 n ' [Zm dZl ■ ■ ‘ dZk 


IJ{z m +l} 

(2z)M+w 


>k' 


t Neyman & Pearson have denoted by L v 


(4) 
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Prom consideration of the derivative with respect to z, of the function on 
the left-hand side of (4) it is obvious that a line parallel to the z x axis, which 
cuts the boundary of the conical region (4), cuts it in two points and all points 
between these lie within the region (4). Let £, £' (£<£') be the z x co-ordinates 
of these points—they will be functions of z 2 , z 3 , ..., z k . Consider 

J z x er lz J7{z m ~ 1 } dz,. 

Integrating by parts, we obtain 


- e~ Ss z, 


/7{z m-1 }J + m x J e~ Ss TI {z 111-1 } dz x 


77{ 2 5} 

since when z x - £ or the equality in (4) is satisfied. Hence 


J> 


m x ) e~ s ~ //{z” 1-1 } dz x = 




m 

, ftV*= Zl (iS)"+« 

■ * Jt m 

Thus, when c x ~ c 2 = ... = c k = c, we have 




£ 

1- 


df+p , 1 


+ 57 K- 


.Tz 2zj 


*' f e-£%(£zr+w i M + jk 1), , 

0c' ci7{P(OT)}J 77{z«} j 1 £z + 2z i r 1 " ,s 

over the region (4). By the method of § 4 we obtain 


0p' kT(M) f 1 ~ky x , , 
d Cl ~2cII{r(m)}j R n{yi } dyi ••• ^ 

over the region R determined by 

where Sty = 1 as before. A necessary condition for the 1C test to be unbiased is 

ir Q> (r==l,2, " ,,fe) 

when Cjl = c 2 =...= Cj., 

and therefore for an unbiased test we must have 


l Um iVi • • % - 1 ir ~ '■ 2 ' •- t) . (5) 

It is obvious from symmetry and from the fact that Sy = 1 that (6) is true when 

m x = m 2 -... = % = m. 
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In fact the K' test is then equivalent to the K test since 

X’ = x {m+i)lm , 

and the K' test is therefore in that ease unbiased. If the m are not all equal, 
suppose m 1 > ra 2 , then oWj+S | jmi+1 a )» 2 +l 

according as a = b. 

Hence if P is any point in R with its y i co-ordinate less than its y 2 co-ordinate, the 
point P 1 obtained by interchanging the y x and y 2 co-ordinates of P will also lie 
in R, but if P is on the boundary of R (or sufficiently near it and inside R), and has 
its y x co-ordinate greater than its y 2 co-ordinate, the point P' will lie outside R. 

% 

Therefore dy x . . . dy k _ x > 0, 

J it n \y s 

and so (5) cannot be true. Thus the 1 C test is biased when the m are not all equal. 
Considering the application to samples from normal populations, we see that the 
Neyman-Pearson test is biased unless the number in each sample is the same, in 
which case it is equivalent to the L test. 

8. The distribution of K when H x is false 
The probability that K is greater than or equal to k is 


v- 

77{c m /’(m)}]' e ^ ) dx \-- dx k 

over the region where 

*m >K 

(Ex)*-' 

which is equal to 

77(/ 1 (m)}J C n ^ z > dz i~' (lz k 

over 

17{c m z m } ^ 

[Ecz) M 

By the method of § 4 we obtain 


V /7{P(m)}J 77 ^ Lki-Alk-x 

over 

All 

Srt 


where Ey-l. When 1c = 2, this can be evaluated by means of the tables of the 
incomplete beta function. It can be evaluated also in some simple cases oft = 3. 
An approximation to the distribution of L in the general case is being investigated. 
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Summary 

Tests of certain hypotheses concerning location and scale parameters are 
developed. These tests are unbiased and are applicable to chance variables with 
any continuous distributions. An application to the comparison of variances of 
normal variables yields Bartlett’s test, which is thus shown to be unbiased. The 
Neyman-Pearson variances test is shown to be biased except when the samples 
are all of the same size. 


REFERENCES 

Bartlett, M. S. (1937). Pm. Roy. Soc. A, 160, 268-82. 

Bishop, D. J. & Nair, U. S. (1939). J. R. Statist. Soc. Suppl. 6, 89-99. 
Nair, IT. S. (1939). Biomelrika, 30, 274-94. 

Neyman, J. & Pearson, E. S. (1931). Bull. int. Acad. Cracovie, A, 460-81. 

-(1936). Statist. Res. Mem. 1,1-37, 

Pitman, E. J. G. (1937). Pm. Camb. Phil. Soc. 33, 212-22. 

Wishart, J'. (1938). J. Agric. Sci. 28, 299-306. 


NOTE 

[As one of the authors of the original Lj test and of the conception of “ bias ” in connexion 
with tests of statistical hypotheses, it is perhaps permissible for me to say that Prof. Neyman 
and I had for some time realized that the test was slightly biased when applied to samples 
of unequal size. The difference between this test and that put forward by M. S. Bartlett, 
and discussed more fully by Prof. Pitman above, may be expressed by-saying that the 
Bartlett test weights tho sums of squares with degrees of freedom where the test had 
used sample sizes as weights. Intentionally the former weighting seems the more appro¬ 
priate to take but, as in certain problems of estimation with which the reader will probably 
be familiar, the application of tho principle of maximum likelihood, a form of which Neyman 
and I had used, introduces sample sizes rather than degrees of freedom. In 1936 B, L. Welch 
{Statistical Research Memoirs, 1, 53) had suggested the use of a test function in the form of 
the ratio of a weighted geometric mean to a weighted arithmetic moan of sums of squares, 
the weights being adjustable at will; but it was Bartlett who definitely advocated the 
weighting with degrees of freedom. 

In a Ph.D. Thesis for the University of London, completed in 1937, only a part of which 
has been published (U. S. Nair, 1939), Dr Nair showed that in a particular case with k = 3 
and sample sizes of 5, 10 and 15 the L l test was certainly biased. During tho past winter 
he also sent me a mathematical proof showing that in the simplest case, with k =2, tho 
Bartlett test was unbiased, Of two or three people investigating the problem, Prof. Pitman 
has however been the first to reach a complete solution. His proof is of great generality; 
not only does it cover the case of any number of samples or groups of data, but it is derived 
from a very interesting general approach not limited to the particular problem of testing 
for homogeneity of standard deviations among normally distributed variables. E. S. P,] 
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(i) On the Calculation of the Cumulants of the distribution 

By N. L. JOHNSON and B. L. WELCH 


In a recent investigation it has been found necessary to calculate the cumulants of the 
^•distribution with considerable accuracy. This may be done, of course, by calculating the 
moments of y about zero and then correcting in tho usual way. It was found, however, 
that a substantial saving of labour may be effected by another method which will bo indicated 
below. This method is essentially the same as that employed by K. Pearson (1916) to calculate 
the constants of the standard deviation distribution, being extended to give the fifth and 
sixth cumulants. 

The distribution of x "with/ degrees of freedom is given by 


p(x) d x ~ 2i^r(iJ) e ~ lx * x '~' d% ' 


The kt h moment of y about zero is 


K = 2** 


nm*)) 

m ’ 


whence it is seen that, the even moments are integers and the odd moments integral multiples 
of n\. The jfcth moment fi k about the moan may therefore bo expressed as a polynomial of 
the fcth degree in n{ with integral coefficients. Hence we obtain the following formulae 
for the first six cumulants: 


= liv 

*2 

= (~ 2/+ l)/Ci+2xJ, 

*i = (- 2 / 2 + 2 /) + ( 8 f~i)K\-K 

. = (16/a-16/+ 3) Kl+ (20 - 40/) k\ +24< 

- (16/ 3 - 24If+8 f) + (- 136f+13(5/- 28) k\ + (240/-120) k\ - 120/c°. 

By substituting (f~K t ) for k\ we may now obtain formulae for /c s> /c 4 , at s and K t involving 
only the first power of /c x and powers of /c 3 up to the third. Thus 


^3 — ^l(l"" 2 /Cjj)j 

K i = - 2/(1 - 2/c 2 ) + 4 K t ~ 0/cl, 

k 5 - *i[4/(l - 2/c,) + 3 - 20 k 3 +24/cl], 

= - 8f(l-2/c s )~/(20-]04Kj+120/cl)+(28/c,-I20icl+120«?). 

The facts that the multiplier of the highest power of/ is always a multiple of (1 - 2/r 2 ) and that 
K » tends to ] as/tends to infinity, suggest that some simplification may result by expressing 
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these cumulants in terms of powers of (1 — 2 K t ) instead of powers of Putting (1 — 2x a ) = a, 
we find 

Kj, = K x a, 

K i = l)a-fa 2 , 

^5 = Kq[ — 1 -b (4/—2) os+ 6a s ] 

= Ki[ —2/c 4 + 3a a ], 

*■« = (2/— 1) —(8/ 2 —8/— 1) a -15(2/—1) a 2 - 15a 3 
= 2(2/—1) K 4 + 3a —12(2/— 1) a 2 — 15a 3 . 

The calculation of the higher cumulants by means of these formulae is rendered very easy 
once Kj is known. For/small k 1 may.be calculated from the following formulae: 


, h (/—!)(/—3).,. 3-1 

/even k.= ----- 

v 2 (/— 2) (/— 4)... 2 


/odd , 1 = 

1 V 7T(/-2)(/-4)...3-l 
= 0-797884360802865, 

J | = 1*25331 4137315500.) 


(To 15 decimal places 


For/ large Stirling’s expansion for the factorial may be used, giving 

l ° gKl = ilOS/ "^ + 2^“2^ + lI^ + -’ 

.('ll 5 21 399 869 ) 

* l ~ ^l 1 4/ + 32/ 2 + 128/ 3 “2048/ 4 “8192/ 6 + 65536/ 6 + '‘''J * 

The above formulae allow the calculation of the cumulants very quickly with as muoh 
accuracy as is ever likely to be required, but the following short tables, giving only a few 
figures, will be useful for many practical purposes. Note that in the second table the 
cumulants are multiplied by powers of / in order to increase the accuracy of harmonic inter¬ 
polation. 


0-797885 

0-363380 

0-218014 

1-253314 

0-429204 

0-177460 

1-595769 

0-453521 

0-148340 

1-879971 

0-466708 

0-128935 

2-127692 

0-472926 

0-115210 

2-349964 

0-477669 

0-104953 

2-553231 

0-481014 

0-096954 

2-741625 

0-483494 

0-090506 


0-114771 

-0-004438 

-0-152659 

0-045149 

-0-037791 

-0-068652 

0-022247 

-0-029635 

-0-029175 

0-012860 

-0-021825 

-0-014166 

0-008271 

-0-016482 

-0-007712 

0-005730 

-0-012867 

-0-004696 

0-004189 

-0-010346 

-0-002935 

0-003190 

-0-008526 

-0-001978 
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/ 

24// 


K 2 


*J 3 

K iWf? 

*J 3 

6 

4 

1 

0-477669 

0-257082 

0-20627 

-0-18910 

-0-9927 

8 

3 

0-969311 

0-483494 

0-255989 

0-20413 

-0-19291 

-1-0128 

12 

2 

0-979406 

0-489176 

0-254427 

0-20025 

-0-19390 

-1-0114 

24 

1 

0-089640 

0-494686 

0-252418 

0-19464 

-0-19211 

-0-9859 

CO 

0 

1-000000 

0-500000 

0-250000 

0-18750 

-0-18750 

-0-9375 


REFERENCE 

Pearson, Karl (1916). Appendix to Papers by “Student” and R. A. Fisher. Editorial. 
Biomelrika, 10, 522-9. 


(ii) Note on Discriminant Functions 

By B. L. WELCH, PhD. 


Recent years have soon ft widespread development of the methods of multivariate analysis. 
Of the problems which have been discussed one of the simplest is that of apportioning 
individuals with several measured characters into one or other of two completely specified 
population groups. Thus if xq, ,r a , ..., x„ are the q characters measured upon eaoh individual, 
and n j and 11 2 denote the two possible populations, wo suppose known the probability 
distributions p t = p{x ll x i , | I1 L ) and p J = p(x ll a , 2> | I7 a ). In particular it is fre¬ 
quently assumed that 77 a and Z7 a have the same set of variances and covariances and differ 
only in the mean values of tile characters. In this case R. A. Fisher (1936) has considered 
the problem of choosing the best linear function X of®,, m 2 , to form the basis of 
classification of the individuals. The solution was obtained by maximizing the absolute 
value of the ratio of E(X 1 11 j) — E(X | I7 2 ) to the standard deviation of X. This is certainly 
the best discriminant function of any kind, whether linear or not, provided jq and p a are of the 
multivariate normal form. , 

However, without making any assumptions of normality or equality of variances and 
covariances, the problem of obtaining the best function to discriminate between two 
completely specified populations may still be solved. The function is simply the ratio of the 
two probability laws p x ip v Tills is almost self-evident, but the following demonstrations 
may bo useful 

Suppose in the first instance that it is possible to assess a priori probabilities w 1 and w t 
that an individual will belong to Il 1 or JJ t respectively. This was possible, for instance, 
m the problem of soxing human mandibles, discussed by E. S. Martin (193(1). Then, if measure- 
ments x x ,x^ ...,x e arc made on the individual, the a posteriori probabilities that it will 
belong to Il x or 77 a will bo <<kPi/(w l p 1 + M a p 2 ) and o) i pJ(o> l p l + w 2 p a ), respectively. Now 
if it is equally important that an individual really belonging to II 1 should not be classified 
as belonging to 77 a and vice versa, then we should assign it to 77 x provided the a posteriori 
probability o> 1 p l l{m 1 p 1 + w 2 p a ) is greater than |. Any other assignation would increase 
the overall chance of misclassifications. We should therefore classify into 1J 1 when 


Pi “h 
Pi Wj' 


( 1 ) 


Whatever the prior probabilities the discriminant function to be calculated from the 
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observed measurements is therefore the ratio of the probability laws; the criterion level to 
which this function is to be referred does, however, depend on the prior probabilities. 

If it is not possible or even appropriate to assess prior probabilities then the determination 
of a rule of classification must be made to depend on other conditions. For instance, it 
may be required (a) that if an individual really belongs to II l there should be the same chance 
of it being miselassified as there would be if it belonged to IJ 2 and (6) that this oonrmon 
chance should be a minimum. If x lt x s , be represented by a point in ^-dimensional 
space, any rule of classification involves the choice of a region R in this space such that an 
individual falling into R will be classified into 77 1 . The above conditions (a) and (6) may then 
be written 

[ p a dT= 1- [ Pi dr, .(2) 

Jr . Jr 

and I pi dr to be a minimum. .(3) 

JR 

The problem is therefore to minimize / p 2 dr subject to the condition that / (p t +Pi)dr 

Jr Jr 

is equal to unity. A straightforward application of the calculus of variations shows that JR 
consists of points such that 

.(4) 

where k is chosen to satisfy (2). The discriminant function is the same as before although the 
criterion level will in general be different. 

Whatever the conditions which are imposed, provided they are formulated on a prob¬ 
ability basis, it would appear that wo shall be led to the same discriminant function. To take 
a further instance we may mention that in discussing the theory of testing statistical 
hypothesis, J. Neyman & E. S, Pearson (1932) have considered the problem of testing the 
hypothesis that a sample comes from a certain population when there is only one poBBible 
alternative population. They imposed the conditions (a) that the chance of rejection when 
the hypothesis is true should be a small specified probability e, and (6) that thB ohance of 
rejection when the alternative hypothesis is true should be a maximum. The solution 
depended only on the ratio of the probability laws of the sample on the two hypotheses. 
This ratio has been termed by Neyman & Pearson a “likelihood ratio’’ and sometimes 
a “ test criterion ”, It seems to me that “ discriminant function ” is a better term for this ratio 
and that “criterion” is best reserved for the particular value of the function which is con¬ 
sidered critical. This value will, as we have seen, depend on the manner in which the problem 
of deciding between the two populations is formulated. 

When the populations II 1 and I7j are normal multivariate, differing only in their mean 
values, it is easy to see that the ratio pjp^ leads to the same linear function of x lt x lt ...,x, 
as is obtained by maximizing {E(X \ 17 x ) - E(X | Z7j)}7°i> For if 

P(xi, » a , ■■.,x,\n t ) = (q^~) exp [ ■- j £ W li {x t - d u ) (x, - 0,,)] (t = 1,2), 

.( 6 ) 

where W is the determinant of the covariances and TP” are the elements of the reciprocal 
matrix, then 

log ^ = - 1 x TF w {(aj( - e n ) (x , - e n ) - {x, - 6 ti ) (x s - e it )} 

pi i,j 

i,J 7J 

X W^idjx - 6 ] a ) x, is the same linear funotion as is obtained from the other approach (R. A . 
i.i 

Fisher, 1930). 
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(iii) Principles of Genetics. By E. W. Sinnott and L. 0. Dunn. Fourth Edition. 
London: McGraw-Hill Publishing Company, 1939. Price 21s. 

This book, which has gone through three editions since 1925, needs no recommendation 
to geneticists. It is so good on the whole that a short review can bo mainly concerned with 
its defects. Mendel did not use red-flowered peas as stated on p. 41, but purple-flowered 
( rot-violett) and, if he had crossed rod and white, would probably havo got a purple F v The 
treatment of biometrical methods on pp. 138-51 is unsatisfactory. For example, the 
account of the significance of a difference of means takes no account of "Student’s’’ work, 
and is therefore just 31 years out-of-date. There is no need to point out the serious nature 
of the errors which may arise from this neglect. And since Bateson’s original definition of 
genetics included a study of populations, I cannot but feel that tho authors havo devoted 
too little .space to this topic. Nevertheless, as an elementary account of modern genetics, 
the book can be strongly recommended. 

J. B. s. H. 


(iv) Corrections to formulae in papers on the moments of y". By J. B. S. 
Haldane, 

The following errata should be noted: 

Biometrika, 29, 138, equations (3): 

For n%~ = 8 + 2( 1 lk - 6) 4- (ft* - 30/e -f 120) s~ a , 

read /t 3 =/c 3 = 8-4-2(11^-- 56) « -1 -h(A: 2 — 30^-4-120) 

Biometrika, 29, 390, antepenultimate line: 

For /c, = 3840 4- 21,300m- 1 + 249,600wr a 4- 69,160m- 3 + 2004m- 4 + m~\ 
read K t = 3840 4- 124,800m- 1 4- 249,600m- 2 + 69,160w- 3 4- 2004m~‘ 4- m~\ 

Biometrika, 29, 391, equations (1): 

Fori k s - 3840a + 21,30022,.4- 249,6007? 2 -j- 69,160i? 8 4- 20047/, +.R i , 

read K, = 3840n4-124,800R 1 4-249,600R 2 +69,160E 3 +2004N 4 4-.R 5 . 

And line 20: 

For 4-5814-04n 2 4-18,640-492ft, 
read 4-5814-04ft 2 4-39,340-492ft. 


J. B. S. H. 
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ON GENERALIZED ANALYSIS OF VARIANCE. (I) 

By P. L. HSU 

1. The Wilks-Lawley hypothesis. Suppose that, as a result of some random 
sampling, we are in possession of p(n 1 +n) quantities, where n'^p, calculated 
from the observational data. Calling these y ir and z ir - (i — 1,2, ...,p: r— 1,2, ...pnp 
r 1 = 1,2,we assume that in repeated sampling they follow the probability 
distribution 

( 1 P «1 1 P n \ 

Const. Xexp -- X oc u X {y ir -ifi ir ) {y jT -Vir )X a« X z ir z jr \IIdydz, (1) 

[ t'=U r=l J 

and that we have no previous knowledge of the values of the vj ir Our problem is to 
test the hypothesis H 0 : 

y ir = 0 for i = l,2,...,p; r= 1,2, {H 0 ) 

For p = 1 the hypothesis is that to which every linear hypothesis* may be 
reduced, and the test amounts to that ordinarily employed in the analysis of 
variance. For % = 1 the problem calls for Hotelling’s generalized “Student’s” 
test.f 

It should be emphasized that the y ir , z ir and y ir will not usually correspond to 
the original physical observations and physical constants belonging to the sampled 
populations, but are so derived from them that (1) is true and that H 0 is equivalent 
to some hypothesis regarding population constants that we want to test. In another 
paper we shall deal with a general class of “linear hypothesis” on multivariate 
normal means, showing how all of them can be reduced by a rotation of the sample 
space to the “ canonical ”H 0 . Here we content ourselves with the following example. 

Example. Case of k samples. x ivl (i= 1,2, ...,p; v=l,2,...,jfc; t=l,2, 
are k samples drawn respectively from fcp- variate normal populations all of which 
have the same set of variances and covariances. Let the population means be 
£i v (i = l,2,....pi v=l, 2,Let 

k ^ k 

X If = jf S W|i lV , 

1 _ 1 *L _ 

= — X m X 

i m „ =i 

(i,j = l,2,...,p; v=l,2,...,k). 

* Kolodziezczyk (1935), Tang (1938). 
f Hotelling (1931), Hsu (1938), Bose ,and Roy (1938) 
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Suppose that we want to test the hypothesis: 

la = £»=••• = £<* for i=1 > 2 ,•••>?>• 

Then it is known that if we call y ir and z ir certain linear functions of the x ivt and y ir 
certain linear functions of the £*,„ (1) will hold true and the hypothesis under test 
is equivalent to H 0 . Thus reducing the problem, we have 

n 1 = k — 1, n — M-lc, 

m A: n k 

X VirVjr 1=1 X *fy) fajv X "ir^jr = X i 

7*ssi r=l p—1 

tti _ 

X \lirVjt ~ X ^v(£i'y £t) (£j> 

r -1 p =1 




In particular, if, Ic = 2 then n t = 1, and we may drop the second index of y 
and y. We have then 

y iVj = (% - %) (*ji - %). 

U Wlj 7U| 

2%r z lr = 2 (%i“%)(^U-^l)+ S (^2/~® <a )(^2(-^a)» 

real £ = 1 £= 1 


(f,i=lj2, 

The hypothesis in the above example has been studied by Wilks (1932), while 
an attempt to test the general hypothesis H 0 was made by Lawley (1938), Both 
assumed no prior knowledge of the values of the Oy. We propose to call H 0 the 
Wilks-Lawley hypothesis. 


2. Case where the a tj are known, generalizations of Mahalanobis’s distance . 
From now on we shall write 

n 1 n 

®1 j ~ S Virlljn z ir z Jr> 

r=* 1 

tii = S hirVjr, S a tj ^, 

r-1 4,1=1 

(i,j = l,2,...,p). 

In order to test Jf 0 when the ol^ are known, we calculate the likelihood ratio* 
and get p 

-2 log (likelihood ratio) = £ = S, say 

4 , 3=1 

and decide to reject H 0 if the value of S exceeds some fixed constant, chosen so as 
to fix at some desired level the risk of rejecting H 0 when it is- true. 

* For the definition and interpretation of the terin see Neyman & Pearson (1928). 
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Theorem 1, For arbitrary values of the 7j ir the distribution of 8 is that of the sum 
of pn, non-central squares :* 

I oo iffkm ] 

2 - iP n l8 ivn x -l f ,-M + V) S - -, \dS. (2)' 

U=o i h h[r(h + ipn 1 )j w 

In particular , if II Q is true, then F = 0 and $ follows the yf distribution with pn x 
degrees of freedom. 

Proof. Let the sets of variables (y lr , y 2r ,..., y pr ), for r = 1, 2, ..., n h be subject to 
the same linear transformation such that, calhng the new variables (%, u Zr> ..., u pr ), 

p p 

S a ijVirVir = liK, ( r= l)2, 

1 , 1=1 1=1 

This* is possible because the matrix ||a^|| is positive definite. Let p ir be the same 
linear function of the rf s as u ir is of the «/’s. Then 


5 = S S 4 (3) 

1= 1 r= 1 

and the u ir follow the distribution 

(2tt)-^«i exp | - ~ £ S K-^) 2 ! Hdu. (4) 

l ^i=l r =i ) 

From (3) and (4) follows the result (2),f as we have 

P n, p n, 

2 2 dir = S “y S 7l ir 7j ir = IF. 

1=1 r=l 1,1=1 r-1 ' 

From (2) we get <o(S) = W+pn v (5) 

£{S 2 ) = W 2 +2{pn l +2)'F+pn l {pn l + 2), 

whence <r 2 (S) = 4F+2pn r 

If these general results are applied to the above example of k samples, we have 

i) fc 

S = 2 = 0*. 

i,i=i v=i 

1,1=1 X=1 


say, and the distribution 

w xnh ah 

* is" 4 A Mr(^+|p(fc-i)) 

For A « 2 we have 

= S %(%-%)(%-%). 

"*1+ "*2 1, 3 = 1 

m l T "»2 1,3 = 1 


* Fisher (1928), p. 669. 


f Fisher (1928), Tang (1938), pp. 138-9. 
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and the distribution 

( to xuh ,C 111 \ 

)dS 2 . 


XUh ah 

2-iPSV- 1 e^ s ^\ S 2 2 


i*h\r(h+te>) 


The quantity x l\ is equal to 


'P m i m 2 A 
m 1 + m 2 


( 7 ) 


where A is Mahalanobis’s “distance”* between two populations having the same 
variances and covariances weighted according to rn 1 and m 2 . The statistic 


m L + m 2 
pwqmr. 



1 

mf 


which, according to (5), is an unbiassed estimate of Zl, is called the D 2 statistic and 
the distribution (7) has already been obtained.! These concepts and formulae are 
completely generalized here by 8b W k and the formulae (5) and (6) for the case 
k> 2 . 


3. Tests suggested by Wilks and Lawley : Generalized E 2 , 1 — E 2 and E\ 1 - E 2 )~ l . 
From now on we shall assume complete ignorance of the values of the <x,y. In the 
case p = 1 the following functions are well known: 


n, I / n, n 

w = 2 Vr I ( 2 fr+ 2 z r 
r— 1 / \r a l r»l 

n I / ni n \ 

s*?/ £s£+ 2*v 2 

r-1 / \r=l r-1 ! 

ni in 

E 2 !(i~E 2 ) = s yf 2 *?■ 

r=l / r=l 


Wilks, while studying the /c-sample case,J suggested the functions 17 and IF 
respectively as generalizations of E 2 and l-E 2 , 'In our general case these are 
defined as 


JJ _ 1 a i l \ _ 

l a y+M 


if n^p, 


( 8 ) 


and 


W — 1 ^i) 1 

K + M 


(9) 


Both are ratios of two determinants. The above definition for U based on the idea 
of a generalized E 2 breaks down if n 1 <p, as then | a {j | vanishes identically, This 
difficulty can be met if we define U as the product of the non-identically vanishing 
roots of the determinantal equation | a {j - 6(a ij + b {j ) | = 0. this definition 

evidently coincides with (8), 


* Mahalanobis (1936). 


t Bose (1936). 


$ Wilks (1932). 
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As a generalization of E' 2 (l - A 2 )" 1 we take, instead of 17 If' 1 , the function V 
suggested by Lawley* and defined as 

f = E Way, 

1 

(10) 

where b ij is the element (i, j ) of the reciprocal matrix || by || -1 . 

If we denote by 6 lf 6„, ,..,0 L , where l x is the smaller of the integers p and n v 
the non-identically vanishing roots of the equation 

K-0(fl%+y 1 =0, 


then, according to (9) and (10) and the new definition of V , we have 


u - n e* 

(11) 

n 

1 

(12) 

ii 

1 ^ 
^ ‘ 

(13) 


W and V are test functions for // 0 suggested respectively by Wilks and Lawley. 
If we use W (or V), we reject Ii 0 if If is smaller (or V is greater) than some pre¬ 
scribed constant. In the following sections we shall study the distributions of 
U, V and If in repeated sampling. 

4. Case when U Q is true. If H 0 is true, then, putting all the ij ir - 0 in (1), we get 
the parent distribution 

const, x exp j - ^ J, Uy(ay + by) j U dydz. (14) 

The following theorem has been proved elsewhere, f 


Theorem 2. Let l x be the smaller, and l 2 the larger, of the integers p and n x . Let 
Q v , B 2 ,..., d[ be the non-identically vanishing roots of the determinantal equation 

|a«-<?(«y + MI-°» ( 16 ) 

arranged in the order of descending magnitude: 1 ^6^6^ ...^6^0. Then the 
simultaneous distribution of the d’s, as derived from (14), is 


n ih 


h 

flnrv 


r\{n~'pLk+i) ) 
-li + ijtyin-p+ijr&j 


n<9, 

i=l 


! h )i(n~p-l)l h l, 

n n n M-) 

i=l ; u=l j=i+1 


n dd { 

1 


(16) 


Therefore the distributions of U , If and V are those of the functions in (11), (12) 
and (13), where the B’s follow the distribution (16). 

f Hsu (1939), Fisher (1939). 


* Lawley (1938). 
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Take the particular case l x = 2. We have then 

u = oa, w = (i-d l )(i-o i ), 

whence ?7 ! + lT l ^l, V =W~ 1 {1~U-W), 

and the distribution (16) becomes 

( ^ i ^)‘"=- 3> {(i-g.Hi(17) 

From (17) we easily obtain the joint distribution of U and W: 


, n f (w ~^ 2+I) T -s W^-v-'UUdW. 
4r(i t -i)r(n-p+i) 

Integrating with respect to IF ranging from 0 to (1 - U 1 ) 2 , we get the distribution 
of U: 

__Jd») 

Similarly, the distribution of W is 

1 


2B(l it n-p+\) 

Again, from (18) we find the distribution of F: 
F{n-p + l 2 + 1) 




ir(i 2 -i)r(n-p+\) 




in 

iJ 4 


4(1+F) 
(2+TO* 


yi(n-p+l) _ y)WrV(ly 


(19) 


dV . (20) 


Now in the particular case considered where l x - 2 we have either (i) p = 2, 
l 2 = w x if jKWj, or (ii) n x = 2, l 2 = p if n x ^p. Hence, from (18), (19) and (20) we 
get the following three pairs of distributions: 

1 £7l(m-3)(i- Wy-HU, 2 


2U(w 1 -l,n) 

1 


2B(p-],n-p + 2) 

1 


Uto-Vll-Wy-MdU, 2=n^p, 


2B(n lt n~l) 

1 

2B{p,n~p + l) 

r(%+n-1) 

4r(n x -i)r(»-i) 

F{n + 1) 


4f(p-l)r(n-p+l) 


W«»- 3 >(l-JF‘)“i-W, 2 =p^n v 
W^-v-VO-W^-'dW, 2 = n 1 < 3? , 

(1 +f yl(n~i)n -yY^i-Vdy, 2 =p^n v 

J 4(l + F)(2H F) - ‘ 

(1 + F) -i(n ~ p+3) dF j* 1 y\Cti-p^){\-y)\(p-^dy, 2 = n^p. 


4(T + ni2+TT 
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The general expression for the moments of U and W can easily he deduced 
from (16). We have, from (16), 

J 1=1 

where C(l 2 ,n) = 7r y i If jnjj —n ~ > 

' 2 i=i ruk - i 1 +i)rMn-p +») rj* 

I k - llfe-t-l) I h I h l. 


mk,n) = \\[d, 
U-i 


jnu-^) 


n n (Oi-Oj) 

i=l j—i-j-l 


Hence 


j(8\l^n) n dd i - 
J i 


whence <f( U*W*) = C(l»*) Jm+fcn + g,) ft dd, = 

= *1 fy(n-p + l % +i)ty{l i -l 1 + 2q 1 + i)rj{n+2q 2 -p + i) 

~ iJiPU n -P + 1 2 + 2 ?i + 2 ?2 + *) - li +i) r\(n -p +i)' 

The case of k samples (see p. 221) of two variables (p = 2) has been studied at 
length by Pearson & Wilks (1933). The hypothesis f/ 0 and the functions U and 
W are called in their paper H 2 , U 2 and L 2 respectively. It is found there that the 
test based on U 2 (i.e. U) cannot be regarded as an adequate test of if. We shall 
show in the next section that H 0 is true if and only if two population constants, 
called A r and A a , both vanish. The disadvantage of D referred to by Pearson and 
Wilks, may be expressed by saying that it is unlikely to be able to detect the 
falsehood of H 0 if only one of A x and A 2 vanishes. 

5. Simplification of the parent distribution function. The matrix ||a y || is 
positive definite; hence it can be expressed as GG',* where C is some non-singular 
real matrix. Write 


Y = 

2/u 2/ia •• 

• Vlni 

, z = 

z u 

Z 12 • 

• Z ln 

, G = 

Vu V 12 

• Vlni 


2/21 2/22 • 

• Viiii 


Z 21 

Z 22 • 

■ z 2n 


Vil 1 ?22 ’ 

• him 


Vpi Vpi ■ 

• Vpni 


Z pl 

Z „2 • 

■ z pn 


Vpl Vp2 • 

• Vpni 




Theorem 3. Let G be of rank l (hence l < p, l < n t andl is the rank of GO' = || ||), 

and lei A x , A a , .... A, be the non-vanishing roots of the determinantal equation 
\f ij -\(r i .\ = 0. Let 6 V d p 6^ be the non-identically vanishing roots of the 
determinantal equation | a#-%#+&#) I = 0. Then the joint distribution of the 6’s, 
as derived from (1), depends upon the parameters A x , A 2 ,..., A, alone in such a way that 
instead of (1) we may regard the parent distribution as 


(2n)-ir< n i+ n) exp (- \T) exp 



* The aocent denotes the transposed matrix. 
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Hence, in studying the d i stribution of any functions of the 0’s, such as U, 7 
and W , we may replace (1) by (21). 

Lemma 1 (Hotelling).* Let A and B be any positive definite matrices of orders 
m and n respectively , and G be any real matrix of order mxn and rank l, There exist 
two non-singular real matrices, Nj and N a , such that 

N t AN| = I.f N 2 BN' = I, N t CN^ = D, 

A 

where D= ° ,f E> VA = A 

°° 1 A 

and the A i are the non-vanishing roots of the determinantal equation 

|CB _1 G'-AA| = 0. 

Allowing both A and B to be unit matrices in Lemma 1 we get the following: 

Corollary. Let C be any real matrix of rank l. There exist two real orthogonal 
matrices I\ and T 2 such that 

I^CTi = D 

A 

where D = ^ ^ , ' D v - A = A 

A 

and the A £ are the non-vanishing latent roots of the matrix CC'. 

Proof of Theorem 3. We write tr A for the sum of the diagonal elements of any 
square matrix A, It is easily verified that tr (AB) = tr (BA) whenever AB is a 
square matrix. 

The expression inside the bracket in (1) may be written as 

- \y- jftr (CC'YY') — *tr (CC'ZZ') +tr (CC'GY') 

~-W-\ tr (C'YY'C) -£ tr (G'ZZ'C) + tr (G'GY'G). (22) 

The A £ , as defined in Theorem 3, are the non-vanishing latent roots of the equation 
| GG' - A(C') -1 C- 1 1 = 0. On pre- and post-multiplying by | C' [ and j C | respec¬ 
tively this becomes |C'GG'C--AI| =0. Hence the A f are the non-vanishing 
latent roots of the matrix (C'G) (G'G)'. Therefore, by the corollary to Lemma 1, 
there exist two real orthogonal matrices, I\ andT 2 , such that 

D Va O 

r^'GTg = D = VA 

2 0 0 

* Hotelling (1936), pp. 326-30. 

f I and O stand for a unit matrix and a zero matrix respectively. 


( 23 ) 
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The transformation of variables 




Y = 

(C')-ir;ur', 

z = (C'^riv, 

(24) 



u n u vl 

• • ’ W X»1 

% . % •• 

• hn 

where 

U = 

u n u 22 

”• U 2ny 

J 

y _ % V 22 ' • 

• v* 



U pl U p2 

... u vni 

v pi V pi • • 

■ Vp U 


are the matrices of the new variables, has a constant Jacobian and leaves the 9 's 
invariant, because the equation (15) becomes 


(15') 

n x n 

where a’ {j = J u ir u jr , b’ tj - JJ v ir v jr , ( i,j = 1,2, 

r= 1 , r= 1 

To show this, we have, remembering the orthogonality of I\ 2 , 

||<%|| = YY' * (C') - 1 rjUUT 1 C'- 1 , 

IM = zz'MC')- 1 r;vvT 1 c-h 

Hence the equation (15) is transformed into 

|UU'-0(UU'+VV')| = 0. 
which is another way of writing (15'). 

On the other hand, substituting (24) into (22) and remembering (23) we get 
the expression 

(UU') ~-|tr (W') + tr (DU') 

1 1 i-1 

Replacing again the letters u and v by y and z respectively, we obtain the result. 
It may be noticed that 

W = tr (CC'GG') = tr (C'GG'C) = £ A,-. (25) 

i=l 

Remark, For the case of k samples (see Example on p. 221) Fisher (1938) has 
considered the problem of testing for the colinearity or coplanarity of the k 
populations. It can be proved that the hypothesis of colinearity (or coplanarity) 
states that all except one (or two) of the A’s vanish. 

6. Behaviour of Y when H 0 is not necessarily true. The Laplace transform of a 
probability density function p(x) vanishing identically for x < 0, viz. the integral 

t*oo 

e~*xp(x) dx, 

Jo 


has the following property. 
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Lemma 2. Let p n {%), (n- 1,2,3, and p(x) be probability density functions 
vanishing identically for x<0, and such that 


*£0 

lim e~ ax p n (x)dx- 
n-vtoj o 


for every a > 0. Then 


lim 


c p{x) dx 


(26) 


p n {u)du = p{u)du, 
o Jo 


Proof. The function/,^) = p[x)-p n {%) summ&ble in (0, oo) and., by (26), 


/•oo 


lim e~ xx f n (x) dx ~ 0 for every a > 0. 


3 J 0 


(27) 


The function 


( Jn( z ) 


-r. 


X fn(z)dx 


is an analytic function of z, regular at every z with a positive real part and uniformly 
bounded. By (27) g n (z) tends to a limit whenever ? is real and positive. Hence by 
Vitali’s theorem of convergence* g n (z) tends to a limit uniformly in the half-plane 
on the right of the imaginary axis. This limit is an analytic function regular at 
every z with a positive real part and vanishes whenever z is real and positive; 
therefore it vanishes identically. Hence 


lim 

n-»a> J 


g-ax+ilx 


•to 

p n {x)dx~ e~ ax + ilx p(x) dx (28) 

'Jo 


for alL a > 0 and real t. Let 

L n (a) = I er ax p n {x) dx, L{cc) = I e~ ax p{x)dx, 
Jo Jo 

lim L n (a) - L(a). 


so that 


It follows from (28) and (29) that 


(29) 


If 01 1 f“ 

lim y e~ au+ilw p n {u)du = -yr- e^ a+Uu p(u)du, 
J o -0(a) Jo 

whence, by a well-known property of the characteristic function, 

1 


whence 

whence 


lim — 

n-*co 

lim 

n—>oo, 


h 


Miu = W)il 


e~ av p(u)du, 


r* 

e~ au p n (u)d.u— e~ au p(u)du , 

Jo 


lim lim 

a-»-t -0 n~?co 


Cx 


e ~ all Pn( u ) du - p(u) du. 


The order of the above repeated limit can be interchanged because 


i: 


e~ au p n (u)du-+ 


p n {u)du 


uniformly in n as a^+0. This completes the proof. 

* Titohmarsh (1932), p. 168. 
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We shall now derive an expression for the Laplace integral <#(exp (- aF)). To 
simplify the notation we shall use a single letter, e.g. y to denote a set of variables, 
e.g. all the y ir , when they figure as arguments of a function. We shall write dy for 
IJdy and a single integration sign for multiple integrals with respect to the variables 
from -oo to oo. We denote the integral <f(exp (- ccV)) by L(a). 

The following equation can be verified by direct integration, on referring to 
(10) as the definition of V: 

exp (- £a 2 V) = jf(y, z,a.,t)dt, 

where 

f(y,z,a,,t) = |] in iexp{— | £ b^ + ia'Z tyiAr}, 

{ ^ i,j—l i=l r—1 i 

ni 

$ij = 2 ^ir^jr (^5 j = ^ • • • j P)> 

r=l 

and where the letter i , when not figuring as an index, stands for <J- 1. Denoting 
by p(y, z) the coefficient of ITdydz in (21), we have 


L{\ « 2 ) = jp(y, z) dydz jf(y, z, a, t) di = dt jp(y, z)f(y, z, a, t) dydz , (30) 

as the change of the order of the integration is obviously legitimate for every real a. 
By direct substitution it results 


p{y,z)f(y,z,x,t)dydz = (27r)-^( 2| n+«)exp(-|li [/ )/ 1 (a,f)/ 2 ((X,f), (31) 


r ( i v 1 i— v "■ 

where A(a, t) = exp - - £ a u + £ VA t y u + ia £ £ t ir y ir 
J l "i=l i=1 i=lr=l 

/«(<*> *) - J| | i% exp 


dy, 


Ip ip 

"o S 

^ i=l 'Hj-l 


The integral (32) is readily evaluated and runs 


A(a, <) = (2zrp'*i exp (£!F) exp | - |a 2 £ £ t%- ia £ VA,;** 

i=l )'=1 i=l 


(32) 

(33) 

(34) 


The integral (33) can be evaluated by using Wilks’s formula for the moments of 
the generalized variance.* The result is 


/,(«, t) = 2iJ , ' l i(27r)t®» J£|tf iy +s y | -«•»+«, 
where S. Cj = 1 if i = j and 0 otherwise, and where 

/A rb(n-i + 1) ’ 


( 35 ) 


* Wilks (1932). 
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From (34), (35), (31) and (30) we obtain 

L{\ct?) = tt-^iK f| %+Sy |~i<«i+»>exp j - |a 2 £ £ Va ( *«| dt , 

J n--i -i=i J 


whence 

L{a) = ir-^K 


S {j + s £j |~K ,l i+»)exp 




i =1 


i«l 


df. (36) 


It may be observed that the right-hand side of (36) involves no sum of n terms; 
hence it is particularly useful when we wish to make n approach infinity. 

We may calculate the moments of V by means of (36), Thus 


c?(F) = n-i^iK 

I 

~ n-p -1 


(W+pnf), 


dt 


(37) 


•irtonK 2 s h ) +2 S 


{(£««)“+ 


+ 


o (2 ^A { 

" \i=i / 


dt 


pn 


4£A,A/ 

i+i 


(?i-p~l)(ri~p~3) (n-p)(n~p-l)(n-p~3) 


+ 2<F 


pw-i + 2 


•a(y-i)K-i) 


-p 


(n-p-1) (n-p-3) (n-p) [n-p- 1) (n-p- 3) 
pn 1 (pn 1 + 2) 2(p-1) (n 1 — 1) 


(n-p -1) (n-p-3) (n-p) (n-p-1) (n-p-3)' 


(38) 


Remark. Consider again the case of k samples (see Example on p. 221) and 
write V k and W k correspondingly. It is seen from (37) that the statistic 

{M-k~p-l)V k -p{k-l) 

is an unbiassed estimate of ¥ k . We may regard V k as a generalization of V 2 , which is 
the “Studentized” D 2 statistic* but for a constant factor. F 2 is, of course, also 
identical with Hotelling’s generalized “Student’s” ratio except for a constant 
factor. 

We shall next study the behaviour of F as n-+oo. As n is now allowed to vary, 
we shall attach to various letters the index n, In particular we write x F n for ¥ 
to emphasize the fact that the value of ¥ depends also on n, a fact which is not 
brought out by the formal definition of IF. This is because the ij ir , as we explained 
on p. 221, are themselves linear functions of the original population constants 
with coefficients depending upon n. 

* Bose and Roy (1938). 
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It is in general true that, except when IJ Q is true, in which case W n s 0, W n is 
either 0(1) or 0(n) as n->co, That both are possible is seen in the Example on 
p. 221 for k = 2; W n is 0(n) or 0(1) according as m 1 = m 2 ->co or one of m lt 
remains fixed while the other tends to infinity. 

Theorem 4. If W n — 0(n), then the mean of V is ~ W n + o|^j and the variance 
of V is o/^ ). Hence the random variable F - - !F n 0 in probability. 


This is an immediate consequence of (37) and (38). 

If W n = 0(1), and if W n tends to a limit as n-t-co, the limiting distribution of 
nV will be that of 8 (cf. (2)) with W replaced by its limit. This is our next theorem. 
We assume now that 

Iimy*=!F 0 . (30) 

co 

The case F n s 0 (i.e. II 0 is true) is covered by (39) on putting = 0. But (39) 
may also be regarded as covering the case W n = 0(n ) in the sense that the repeated 
sampling is not made from the same population, but from populations whose 
distribution constants vary with n in such a way that (39) holds true. Thus, for 
example, in the case of two samples with m 1 - m 2 (see p. 222) we have 

W n = \md> = l[n + 2)0, 

p 

where 0 = £ a. i:l (£ a -£ i2 )(i, n -£ j2 ). 

Here we have x F n - 0{n) if the population constant <t> > 0. W n will be 0(1) if we 
consider 0 as decreasing to the order n~ l as n increases indefinitely. This idea of 
regarding population constants as varying with the sample size can be found in 
the works of Fisher* and Neyman.f 


Theorem 5. If [ I J n lends to a limit F a as n~>co, then 


where. 


lim Pr{nV 

n->co 



co 

p(x) = 2 

h =0 


b h h\r(h + \pnf}' 


(40) 

(41) 


In particular, ifW 0 - 0 (i.e. H 0 is true), the limiting distribution of nV is that of y 2 
with pm,! degrees of freedom. 


Proof. Write Lf\a) for the Laplace transform of the probability density 
function of nV, so that 

Jjf\<x) - cf(exp(-~awF)) = L(m). 


* Fisher (1928), p. 603. 

t Neyman (1937), pp. 169 -70, (1938), pp. 70-1. 
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If we replace a by na in (36) and subsequently make the change of variables 
hr = n-h ir , we get 

Uf\a) - r-^vr^hK t 


k+r? 




where K„ is now written for K and 


exp (- a S 4 ~ i V( 2a ) S n^t a] dr, 

i=l i“l j 

(42) 


n t 


S{j — Ti r Tj r , (i, j — lj 2, 


r=l 


We shall now find the limit of Lf\ot) as n -» oo. We have, using Stirling’s formula, 


lim n~ ipn iK„ — 


For every system of fixed values of the r iT we have 


(43} 


whence 


lim 

7l->0O 


&U + - 

1 n 

4 + 


= i + l -ib+o 

n i= 1 


ft' 


2 > 


n 


!(»,+«) 




(44) 


Calling 7 n the integral in (42), we have 

|-l(«i+n) 


4= ■ 

« 


4+ 


- exp (- ££ 4) [ exp J - a £ 4 - * V( 2a ) .1A r ti ) dr 

+ fexp (- (| + a) f; *«-»V( 2a ) S dr 

J { i<= 1 i= 1 i 




(45) 


The absolute value of the integrand of A n is less than 

2exp|-aS4j. 

Hence by (44) and dominated convergence, A n ->- 0 as n~> oo. The value of B n is 
easily found to be 


B n = (27r) 1]3Wi (1 + 2a) -!jm i exp 
From (42), (43) and (45) we obtain the result 


l + 2a 


ccW n 


(46) 


lim Lf ) {a) = (l + 2a) -l3 ' ,! iexp — ff 

n-j-M \ 1 + 2a 

If p{%) is defined as in (41), then the integral 

r oo 

e~ ax p(x) dx 
Jo 

is identically equal to the right-hand side of (46). Theorem 5 is thus proved on 
remembering Lemma 2. 
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7. Behaviour of W when H Q is not necessarily true and n is large. We shall 
establish the following theorem. 

Theorem 6. If W n = 0(1), the statistics nV and —n log W tend to be certainly 
identical in the sense that 

lim (nV + 7i log W ) = 0 in probability. (47) 

tt-VOO 

Corollary. If W n tends to a limit W Q as n — > oo, — n log W has the same limiting 
distribution (40) as nV as n->oo, 

Remark. Besides the Corollary there is another significance to be attached to 
Theorem 6. Since the test functions V and W are not functionally related (except 
when p = 1 or % = 1), there is the question of choosing one of them to be con¬ 
sistently used in carrying out the actual tests. This can only be decided by a 
comparison of their power* to detect the falsehood of H 0 when the y ir do not all 
vanish. While this may be a difficult problem for small samples, Theorem 6 appears 
to have answered the question for large samples. In fact, if n is large, it is almost 
certain that the values of nV and -ralog W calculated from the sample will differ 
very little. That - wlog W and nV tend to have the same power function is 
a consequence of the Corollary. 

Proof of Theorem 6. Bor every 0 such that 0 ^ 0 ^ 1 we have 

6 It 6 V 

Hence, remembering (12) and (13), 

1 Zi / Q. \z ii h 0. \2 

0<F + log^ 5 s(j^) SCSA) -***■ 

whence, for every r\ > 0, 

Pr{| nV + n log W | > ?/} = Pr{nV + n log IT > y) < Pr || P > y 



because of (38). This establishes (47). 

Proof of the Corollary. Let 

F n (x) = Pr{nV <*}, O n {x) = Pr{-n log W <£}. 

Then for every x and y > 0 we havej 

0« G n (x)-F n {x) ^F n {x + y)~F n (x-y) + Pr{nV + wlog W >y). 

* Neymau & Pearson (1936, 1938). 
f Frechet (1937), p. 164. 
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Let F{x) be the limit of F n (x) as n~> oo, which is known to exist by virtue of 
Theorem 6. We have then 

0 G n (x) - F n [x) $ | F n (x + y)-F(x + Tj) | +1 F n {x -ij)~F{x~ //) | 

+ F{x+rj)- F(x~r)) + Pr{nV+nlog W^tj}. (48) 

Given any e >0, choose and fix an rj > 0 so small that F(x + y)-F(x~y)<e. By 
(47) the rest of the terms in the right-hand side of (48) is smaller than e for all 
sufficiently large n. Hence G n (%)-F n {x)-+ 0, i.e. G n {x)-»F(x), which completes 
the proof. 

Summary 

The Wilks-Lawley hypothesis concerning population means of multivariate 
normal populations is put in the canonical form H 0 (§ 1), and, assuming the popula¬ 
tion variances and covariances known, the test function S is derived together with 
its exact distribution (§ 2). In the case where the population variances and co- 
variances are unknown, two possible test functions, denoted by V and W are 
considered (§§ 3 and 4), and their distributions in certain special cases are given. 
In § 5 it is shown that the sample space can be so transformed that all the variables 
are independently distributed and that only a minimum number of unknown 
parameters remain. These parameters are the roots of a certain determinantal 
equation; the hypothesis H 0) and the hypotheses of colinearity and coplanarity 
of populations, all specify the values zero for all or some of these parameters. 
Returning to the test functions V and W, it is shown that as the sample size n 
increases indefinitely the two functions nV and - n log W tend to be certainly 
identical ((47), § 7), and both of them tend to have the same distribution function 
as 8 ((40), § 6), 
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THE DERIVATION OF THE FIFTH AND SIXTH 
MOMENTS OF THE DISTRIBUTION OF b 2 IN 
SAMPLES FROM A NORMAL POPULATION 


By C. T. HSU and D. N. LAWLEY 
Dejnrtment of Statistics , University College, London 

1. Introduction 

The method of combinatorial analysis was first introduced in 1928 in a paper by 
R. A, Fisher (1929). In this paper Fisher defined new symmetric functions 
k v fe 2 , fc 3 ,... of the observations for samples of a given size, and gave simple rules 
for determining the cumulants or semi-invariants of their joint sampling dis¬ 
tribution. This method is especially valuable for the case where the sampled 
population is normal, and in this case its use reduces considerably the labour 
of deriving the higher sampling moments of the distribution of product 
moment statistics. Two further papers appeared later, one a joint paper of 
R. A. Fisher and J. Wishart (1931) in which further rules were given, the other 
by Wishart (1930) in which he described applications of the theory and gave a list 
of higher order formulae for the normal case. By means of these formulae 
E. S. Pearson (1930) was able to derive the first four moments of the sampling 
distribution of the statistics and b i for the case when the sampled popula¬ 
tion was assumed to be normal. 

The object of the present paper is to derive formulae for *(4 B ) and /c(4 6 ) 
(quantities which are defined below) and then to use these to determine the fifth 
and sixth moment coefficients of the distribution of b v This work is preliminary 
to further investigations regarding the sampling distribution of this expression. 

If %, . .be a sample of size n from a given population, then adopting 

Fisher’s definition of the symmetric functions k v k 2l k z , k it we have 

K = %, 


(»-3) {( ’ 1+1 )",-3("- U"». 

, 1 n 

where m t =*~S{x- xf, x = - 8 *. 

n x' n i 

The coefficients k r are chosen so that E(k r ) - K r , where K r is the rth cumulant of 
the given population, and the pth cumulant of the distribution of k r is as usual 
denoted by K(r*). 
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k(t p ) is expressible as the sum of a number of terms of order pr consisting of 
powers and products of the coefficients k 2 , k 3 , K pr , and to each such term there 
corresponds a number of two-way partitions whose coefficients have to be 
evaluated. As in this paper we are, however, assuming the sampled population 
to be normal, the only non-vanishing cumulant is i<c 2 , which is equal to cr 2 , the 
variance of the distribution. The only term, therefore, which has to be evaluated 
is the one containing a power of k % . 

2. The derivation of k(4 5 ) 

For a full explanation of what follows the reader is referred to the three 
papers by Fisher and Wishart already cited. 

To determine the formula for k( 4 5 ) we must find the coefficients of all the two- 
way patterns for the term in k\°. There are altogether five such patterns, which 
are given in Wishart’s paper, and they are reproduced below. In each pattern it 
will be noted that there are five corners with four arms attached, each comer 
representing a k it and ten connections between the pairs of arms, each 
connection representing a /c 2 . 

/-TVs /-A-\ 

n m 

\i<>i/ \<>/ 

ABC 

Fig. 1. 

For each of these five patterns, we must determine (1) the numerical coefficient, 
(2) the n-coefficient. 

It is not necessary to give in detail all the working required for doing this, but 
we shall give two examples for each process and also a summary of the results. 

(1) The numerical coefficient is obtained by enumerating all the ways in 
which the pattern can be connected up, regarding as a separate entity each corner 
and also each arm attached to that corner. 

(a) Consider first the pattern A. 

p 

Q 


Pig. 2. 

There aie \ x 4! = 12 ways of determining the cyclic order of the five corners. 
There are six ways of choosing a pair of arms belonging to corner P to connect 
up to Q, and similarly for the other four corners. 



AvTA 

v2Sv 

D 
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Finally, there are two ways of joining up the double arms between each pair of 
adjacent corners. The numerical coefficient of A is therefore 12 x 6 6 x 2 5 = 12 x 12 5 . 
(6) Consider the pattern D. 




Fig. 3. 


There are five ways of choosing which corner is to form the centre T. 

There are three ways of choosing the pairs of corners P, Q and R, S which are 
linked up by double arms, and two ways of connecting up these two pairs. 

The arms of corner T may be arranged in 4! = 24 ways, while those of each 
of the other corners may be arranged in 12 ways. 

Finally, there are two ways of connecting up each of the two double arms. 

Hence the numerical coefficient* of D is 

5 x (3 x 2) x (24 x 12<) x 2 2 = 240 x 12 6 . 

We may remark that the five normal patterns for /c(4 6 ) can all be obtained from 
those of x(4 4 ) by the insertion of a fifth corner. This fact allows us to derive the 
numerical coefficients by a different method, which may be used as a check. This 
method must, however, be employed with care as otherwise it is liable to suggest 
a wrong result. We shall illustrate it by considering again the pattern A. 

The pattern A may be formed from the smaller pattern which has a numerical 
coefficient of 62208, by breaking one of the double arms and inserting a new 
four-way corner. 



Fig. 4. 


The break may be made in four places and the arms of the new corner may 
then be joined up in 24 ways. The numerical coefficient of A would thus at first 
sight appear to be 62208 x 4 x 24 = 24 x 12 8 , 

It must, however, be remembered that when for instance the break in the 
smaller pattern is made between P and Q, the broken pattern corresponds to two 
arrangements of the unbroken pattern, since there are two ways of joining up 
the double arms between P and Q. Hence the true value of the numerical coeffi¬ 
cient of A is in fact half the above number, i.e. 12 x 12 s . 

* There is apparently a mistake in Wishart’s paper, the numerical coefficient of D being given 
there as 120 x 12 s . This will of course mean that the approximate value of k { 4 6 ) found in that paper 
is incorrect, 
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(2) The ^'-coefficient. 

In a previous paper (Fisher & Wishart, 1931) rules were given for 
deriving the ^-coefficients of a given pattern from those of non-vanishing 
patterns of a lower order which are already known. Thus we can derive the 
^.-coefficient of A, which is one of the patterns of /c(4 5 ), from those of two 
oi the patterns of /c(4 4 ). This is illustrated in Fig. 5, the Greek letters below 
each pattern are used to denote the ^-coefficient of that pattern. 



a = 2 


(- 1 ) 


,A+ 


n(n+ 1) 


(»-2)(n-3r (w—1)(»—2)(n—3) 


7 . 


and we also know that 




n(n +1) n 1 — 8«. 3 + 21k, 2 — lin + 4 
(?i-l) 3 (w~2) 3 (?i-3)3 


and 


n 3 

7 = (»-l)®(»-2)»’ 


Hence we obtain the result 


_ n(n+l) (■ , 
a (n— l) 4 (tt-2) 4 (w-3) 4 ^ 

This is the ^-coefficient, of A. 
Similarly for pattern B. 


1 l?i 6 + 45?i 4 - 85w 3 + 70w s — 36«. -f 8}, 
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Here we have 


A = 


n +1 




(ft-1) (n- 2) (ft-3) 
n 




and 


Hence 


A = 


« -1 

ft(ft+l)(ft 2 -5ft + 2) 

(ft-1) 4 (ft-2) 3 (ft- l) 3 ' 

ft 2 (ft + l) 2 (n 2 -5n+ 2) 


(ft-l) 4 (ft-2) 3 (ft-3) 3 ‘ 

This is the ft-eoefficient of B. 

The complete results are as follows: 


Numerical 

Pattern coefficient/12 8 
A 12 

B 80 

C 120 

D 240 

E 32 


w-coefficient 

-»i(tt+l)/{(tt-l) 4 { n - 2) 4 (n-3) 4 } 
m°- llft 5 + 45n 4 - 85» 3 + 70n 2 —36w + 8 
9ft s + 23» 4 - 7ft 3 -28ft 2 +12ft 
n a - 12w s + 52 m 4 - 94Ti n + 59ft 2 - 30a + 8 
m" - 13m. 5 + 59w 4 - 103n 3 + 48n 2 - 24 m + 8 
— 15m 5 + 80m 4 - 160m 3 + 75m 2 -41m+ 12 


The formula for /c(4 5 ) is calculated by finding the sum of products of the 
numerical coefficients with the ft-eoeffieients. We thus obtain the result 
4x 12 5 ft(ft + l) 


<( 4 & ) = 


(ft - l) 4 (ft-2) 4 (ft-3) 4 


(12lft B - 1473ft 5 +6335ft 4 

- 10,675ft 3 + 4900ft 2 - 2536ft + 840} 4°. 


3. The derivation of a(4 6 ) 

There are altogether 17 normal patterns for /r(4 6 ), which are shown in Fig. 7. 

All the patterns except the last two (R and S) may be obtained from those of 
k( 4 5 ) by the insertion of a sixth corner, while the patterns R and S may be obtained 
from those of k( 4 4 ) by the insertion of a pair of corners. The numerical coefficients 
may as before be calculated in two ways and we shall content ourselves with a 
summary of the results. We shall, however, give two examples of the derivation 
of the ft-coefficients. That for the pattern A is shown in Fig. 8. Here we have 
(-!) ft(n+l) 


a = 2x 


A- 


(ft-2) (ft-3) 
w(w +1) 


(ft-1) 4 (ft-2) 4 (ft-3)' 


ft + -LAi- y 

1 [n-l){n-2)(n~‘A) r ’ 

{ft 8 -1 lft 5 -+ 45ft 4 - 85ft 3 + 70ft 2 - 


■36ft+ 8}, 


r = 


-§=: 


ft 


ft 3 


( ft -1) ( ft -2) ( ft -1) ( ft -2) ( ft - l ) 3 ( ft -2) 3 


Hence 


(ft- l) 4 (ft-2) 4 ' 

ft(w+l) 


(ft-1) 5 (ft-2) 5 (ft-3) 5 
This is the n-coefficient of A. 


{ft 8 - 14ri 7 + 78ft° - 220ft 5 + 341ft 4 

- 310ft 3 + 212ft 2 -88ft+ 16}. 
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Similarly for the pattern S. 


4 —V, 
s ^ 1 

X 


and 


A = 


/*“ 


v = 


A K 

-t y- 

H- 

Fig. 9. 

(n+ 1) 

(n- 1) (w- 2) (n—3)^’ 


n 

n -1 


", 


n 2 (n+ l) 2 


Hence the ^-coefficient of S is 

A - 


r~i 

t__y 


V 


(n- l) 3 (n-2) 2 (n-3) 2 ’ 
n z {n+ l) 3 


(n- l) 6 (?i-2) 3 (n-3) 3 ‘ 

The complete results are given in the following table. Here c denotes the numerical 
coefficient divided by k, where k = 2 U x 3 4 x 5 = 6,635,520 and a r is the coefficient 
of n r in the expression for ^-coefficient divided by N, where 

n(n+ 1) 


N 


(n-1) 6 (n-2) 5 [n- 3) 5 ’ 


Table of coefficients 


Pattern 

C 

«8 

0, 

<i 9 


«4 


#2 

fli 

a 0 

A 

27 

1 

-14 

78 

-220 

341 

-310 

212 

- 88 

16 

B 

216 

1 

-12 

54 

-100 

33 

96 

- 80 

24 

0 

C 

324 

1 

-15 

88 

-254 

373 

-279 

178 

- 76 

16 

D 

324 

1 

-14 

70 

-140 

53 

138 

-116 

24 

0 

E 

162 

1 

-15 

90 

-266 

389 

-267 

160 

-176 

16 

F 

648 

1 

-16 

99 

-292 

403 

-236 

153 

- 64 

16 

G 

216 

1 

-16 

104 

-324 

449 

-208 

110 

- 62 

16 

H 

432 

1 

-13 

60 

-106 

17 

111 

- 02 

24 

0 

J 

1298 

1 

-16 

100 

-300 

421 

-236 

126 

- 64 

16 

K 

216 

1 

-16 

103 

-320 

451 

-220 

101 

- 52 

16 

L 

324 

1 

-16 

99 

-292 

403 

-236 

153 

- 64 

16 

M 

2592 

1 

-17 

111 

-338 

451 

-193 

101 

- 52 

16 

N 

432 

1 

-13 

60 

-106 

17 

111 

- 62 

24 

0 

P 

1296 

1 

-18 

129 

-442 

675 

-342 

195 

- 94 

24 

Q 

432 

1 

-19 

144 

-521 

825 

-385 

210 

-107 

28 

R 

72 

1 

- 8 

18 

4 

- 47 

12 

36 

0 

0 

S 

16 

1 

- 8 

18 

4 

- 47 

12 

36 

0 

0 




C. T. Hsu and D. N. Lawley 


245 


The formula for ./c(4 6 ) derived from these results is 


*(4«) = 6,635,520 —^ 1 -J 2) ' - - - {9025# - 145,532ft 7 + 920,610ft 6 

~2,775,248ft 5 + 3,759,853ft 4 -l,717,116ft 3 + 946,872ft 2 
-476,064ft + 136,080}4 2 . 


4. The moments oe the distribution oe & 2 

The formulae of /c(4 2 ), k( 4 3 ) and k( 4 4 ) have been found previously and we will 
give the results: 


a(4 2 ) = 


24ft(ft + l) . i 

(n^'l)(»-2)(»-Sj** 


K(4 n . 1728w(w +1) (ft 2 - 5w + 2) 

M j (ft —l) 2 (ft-2) 2 (ft-3) 2 2> 

6912n(w+1) (53ft 4 -428w 3 + 1025ft 2 -474ft+ 180) 8 
K{ (ft -1 ) 3 (ft - 2) 3 (ft — 3) 3 * 2 


We may now calculate the moments of the distribution of & 4 as follows: 
ft(4 2 ) = at(4 2 ), 

/i(i s ) = k(4 3 ), 

/t(4 4 ) = /r(4 4 ) + 3/c 2 (4 2 ) 

=-+ 207ft 4 - 1707ft 3 + 4105w 2 - 1902ft + 720} k», 

(ft — 1 ) 3 (ft - 2) 3 (ft - 3) 31 

/t(4 5 ) = k( 4 s ) 4- 10/c(4 2 ) k(4 3 ) 

= 7- !?4/' 4W( ?u + / I - Tu { fa7+ i*02ft 6 - 17,516ft f +75,870ft 4 

(ft-1) 4 (ft-2) 4 (ft-3) 41 

- 128,205ft 3 + 69,000ft 2 - 30,492ftl +10,080)4°, 

fi( 4«) = /c(4 8 ) + 15 k( 4 4 ) /c(4 2 ) + 10 n 2 (4 3 ) +15* 3 (4 2 ) 

=- +1 ) { u io + 770ft 9 + 278,359ft 3 - 4,603,808ft 7 

(ft — l) 5 (ft - 2) B (ft - 3) 5 

+ 29,339,665ft 8 - 88,717,430ft 5 + 120,300,577w 4 - 66,075,788 ft 3 
+ 30,365,028ft 2 - 15,250,464ft + 4,354,560} xf. 

The quantity 6 2 is defined to be mjml and, using the definitions of k % and k t given 
in § 1, we see that 

, _ (w-l)(ft-2)(ft-3)4 3(ft-1) 

2 ft 2 (w+l) m\ ft + 1 

_ (w-2) (ft -3) k t 3(ft-1) 

■ ~ (w-i)(ft+i)4 ft+i 
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Thus the average value of b 2 is ‘i(n -1 )j{n+ 1) and, for r > 1, the rth moment of 
the distribution of b 2 is given by 


M b t) = 


(n-2)(n-3) 

{n~l){n+l) 


/h(4 2 ) 


(w-2) (w-3) ' 

,(«-!)(«+1) 


/i(4’‘2'“ r ). 


Now Fisher (1930) has shown that 


/i(... 5°4 6 3«) = /t(... 5 c 4*3 a 2-0 - 4. 


(n-ir 


and in particular 




Thus 


/“r(*a) = 


(w-l) r (n-2) r (n-3) r 


A(4 r ) 


(tt+l) r (n-l)(n+l)...(n + 4r-3) ' 

Hence, finally, we obtain the following results: 

24n(n-2)(n-3) 


Ptlfii) _ 


(w+l) 2 (w + 3)(?H-5)’ 


1728w(n-2)(?r —3)(n 2 -5w + 2) 

2 (w+l) 3 (?i + 3)(n + 5)(?i + 7)(?i-j-9)’ 

1728w(n - 2) (n - 3) { n & + 207n 4 - 1707«, 3 + 4105m, 2 - 1902ra + 720) 
(n+l) 4 (n + 3)(w + 5)(2H-7)(n + 9)(w+ll)(?H-13) 

ji 6 (b 3 ) = t—~ i4 h ~ —r—r=7{5ft 7 + 1402w 6 -17,516n 5 + 75,870 m, 4 
r6V (n+l) 5 (w+3)(n + 5)...(n+17) 1 

- 128,205n 3 + 59,000m, 2 - 30,492m, +10,080}, 

(» + 770»- + 278,359n--4,603,808.' 

+ 29,339,556n e - 88,7 17,430r 6 + 120,380,577w 4 - 55,075,788w 3 
+ 30,365,028w 2 - 15,250,464n + 4,354,560}. 


5. Check with McKay’s results in the case n ~ 4 

Since the formulae which we have obtained are somewhat complicated, it is 
desirable to provide some sort of check on them. This is supplied by using the 
results obtained by McKay (1933), who found in an exact form the distribution 
of b 2 for n = 4, i.e. for samples of size 4. 
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Putting x — 6 2 , the distribution function f(x), of x, is given by 
fw = whenl<a?< 2 , 


2^(7~ 3a:)^\ 5 ’ 2 ’ *’ 2 
3 

2(9#— 17) 




when 2 < x < I, 


where 


9a?—17 
(7 — 3a?) 1 * 


The moments of w = |- 6 2 - A about the origin are given by 


Hence 


v k {w) = 




|p(^+y) P(& +1)+ 


lkr{k-l)r(k + 2 ) 

2 4(1 !) 2 


1 3h(k-l)r(k-§)r(k + 3) 

+ 2'2 4 2 (2 !) 2 + 



vM = 


1 

3.7 


21 ’ 


247 


= 


61 

5.7.11.13’ 


p 4 (w) = 


277 

5.7.11.13.17’ 



79 

3.7.13.17.19’ 


46,889 

~ 5.7.11.13.17.19.23' 

The moments about the mean of w are given by 

M w ) ~ 3 > 5 2_ 7 = 3 , 52 , 7 ’ 

, , ^ -48 -3.2 4 

“ 5 2 .7,11.13 — 5 2 .7.11.13’ 

1424 2 4 .89 

™ W ' ~ 5 4 .7.11.13.17 _ 5 4 .7.11.13.17’ 

, N -42,624 — 2 7 .3.37 

/W ~ 55 3 17 19 ” 5 s . 17.19 ’ 

76,096 2 6 .1189 

5 6 .7.11.13.17.19.23 " 5 5 .7.11.13.17.19.23' 
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* 

Thus for the moments of b t , we have 


2 6 

fafiil “ ‘MzMg 'gz ij * 


_2iO t 3 

/4I 6 *) = 35 ^7103* 


/‘A)’ 4 W 1# t = 5 ( t 7 illil 3 il7 > 



-2 l7 .3.37 
5 f .7,11.13.17.19' 


/»eM = 4 V«M = 


2 18 .1189 

5V7.11.13,17.19.23' 


It may be easily verified that the same results are obtained by putting n~ 4 
in the formulae given at the end of the preceding section, There is thus reason to 
believe that these formulae are in fact correct, 
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TESTING THE HOMOGENEITY OE A SET OF VARIANCES 

By H. 0. HARTLEY 


1 . Introduction 

When analysing data the experimenter is frequently faced with the necessity 
of testing the homogeneity in a set of estimated variances. When it is desired to 
combine a number of variances to obtain an estimate of the common variance 
it is necessary to apply such a test. Again, if a selected “treatment mean square” 
is to be compared with an “error mean square”, a test for homogeneity has 
recently been proposed (Wishart, 1938) as a safeguard against the selection of 
the largest mean square from a set of random ones. 

For general use in such cases Neyman & Pearson (1931) have suggested a test: 
the L x test. The statistic L x used in this test has been modified by Bartlett (1937) 
and generalized by Welch (1935, 1936). From recent work (Nair, 1938; Bishop 
& Nair, 1939; Pitman, 1939) it would appear that Bartlett’s statistic ji is the 
best to use, because it is unbiased in the sense defined by Neyman & Pearson 
(1936, 1938). Or more precisely, the L x test in its original form is biased with 
regard to the admissible set of alternatives. 

Some difficulty has been experienced in obtaining the random sampling 
distribution of this statistic which is required for a test. Various approximations 
have been worked out. There are 

(a) Bartlett’s (1937) approximation using the y 2 distribution. 

(b) P. P. N. Nayer’s (1936) approximation obtained by fitting Pearson-type 
curves to the distribution in the special case where all mean squares are based 
on the same number of degrees of freedom. 

(c) U. S. Nair’s (1938) expansion of the exact distribution in the special case 
mentioned in ( b ). 

(d) Recently another paper on the subject has appeared by E. J. G. Pitman 
(1939). In this paper the author transforms the distribution of L x into a multiple 
integral which can be evaluated in special oases (small values of k) by reduction 
to elliptic integrals. 

The accuracy of the approximations (a) and ( b ) has recently been tested 
(Bishop & Nair, 1939) in the special case in which the expansion (c) is available. 
Bartlett’s findings were confirmed; it was shown that his approximation is valid 
only for moderate or large numbers of degrees of freedom (^ 3), We shall also 
show in this paper that even with this restriction for the degrees of freedom the 
approximation is not very accurate if k, the number of mean squares in the set, 
is large. 
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While U. S. Nair’s expansion, although it is very complicated, provides 
a means of working out the exact probability integral in the special case where 
all mean squares are based on the same number of degrees of freedom, there is 
still uncertainty in the general case. P. P. N. Nayer has suggested that the test 
for homogeneity between k mean squares with/* degrees of freedom (t = 1 ,2 , ..., k) 
is (under certain conditions) identical with testing the homogeneity between k 
mean squares all of which have f degrees of freedom, where/is the arithmetic mean 
of the f t . We shall show that, although there is some truth in this statement, 
the harmonic mean should be used for/rather than the arithmetic mean. 

Since Bartlett’s approximation does not provide a test of sufficient accuracy 
in all cases, the main difficulty of dealing with the general case has been the 
large number of quantities on which the exact distribution depends: if the k mean 
squares in the set have / degrees of freedom respectively ({= 1,2, the 
distribution would depend on k+1 quantities. We shall now show in this paper 
that (provided/js 2) there is an approximation of sufficient accuracy which 
depends on three quantities only. These three are: 

(i) k, the number of mean squares in the set; 

(ii) cq = t v-i» where F = £/; 

<=i/i J (=i 

This makes the distribution amenable to tabulation, so that the test can be re¬ 
duced to an inspection of a table of 5 % and 1 % points which can easily be carried 
out by the experimenter. 

In the case where mean squares having one degree of freedom occur in the 
set, the distribution is of a more complicated character, but our approximation 
is still fair. 

2. The formal solution 

Consider k normal populations with variances a] (t= 1,2,..., k). Letbe an 
unbiased estimate of o\ based on / degrees of freedom, and let us denote by F 
the total number of degrees of freedom, 

-F=£Z. .(i) 

<=i 

/ 

Bartlett’s statistic ft is then given by 

-2 log/* = F log {SIM) IF}- S/log si .(2) 

t t 

The equivalence to a special case of the generalized L x statistic (Welch, 1935, 
1936) is expressed by the relation 


.Flogl/ = 2 logyt, 


.( 3 ) 
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where 


«-hi$rmr- 


.(4) 


For our test we require the random sampling distribution <f>{L x ) of the statistic L[ 
under the null hypothesis 

<A = <r 2 , t=l,2,...,k 

Under these conditions it has recently been shown (Welch, 1936) that the (q -1 )th 
sampling moment of L[ is given by 




£ mr r{hF) 

r(iF+q-l) t i\\ 


n .w 


From general principles it may now be inferred that equation (5) is valid for 
all complex q with 

Real Part of q > 1. 

Further, by Mellin’s inversion formula, we obtain from (5) 


<HL[) - 1W) n mt 1 

t-1 


1 

2ni 


•Q+ico k 

n 

Q—i«> (=1 


qpym-DVi'' 

I) 


r 


f {l +L F~)_ 


L'f' 1 dq 


rQF+q-iy 


.( 6 ) 


where Q (> 1) is an arbitrary positive quantity. 
Introducing as a new statistic 

x = ~F\ogL[ = -21og ji, 

and as a new variable of integration 

\ = l+[q-l)IF, 

we obtain for the distribution function of x [ijr(%) say) 
k lF\-V‘ 

ir(x) = rmn[j ) mt 1 *-** 

l M+ico £ I /Ji\A/, 


■(V 


X A-in <Pl IV/, 


Wl) 


oX A 


F(F A) 


dA, .(9) 


where A is an arbitrary positive quantity. 

Using now Binet’s integral representation of log/ 1 (Whittaker & Watson, 
1927, p. 249), we may write equation (9) in the form 


where 




o \2 r^e T -l/r L(= 1 


/M+t» 

X-l(k-l) e x\ e EW d x, , 

J A-ia 

....(10) 

^ e -rM_ e - T 

....(11) 










.(12) 
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Introducing g(r) = (g ■- ~ \, 

we see that g[r) has continuous derivatives of any order for 0 < r ^ oo. We may 
therefore transform the integral (11) by integration by parts, differentiating 
g(r) and integrating the exponential functions. We obtain 


m 


1 

r * 

v l 

fiv 

1" 

1 

'y(-) 

1 " 

12 A 

L«=i 

l/J 

- 1_ 

360A 3 




+ 


A 3 


rd*g(r)r 

■ ft /0-rA/A 


Jo dr 3 L 

.A\ ft 1 

J3 


.(13) 


We now approximate to E( A), and therefore to f(x), by ignoring the last summand 
inequation (13), and write 

. < 14 > 

* /1\ 1 k i 1 \ 1 

where and c 3 = 2^)-^.( 15 ) 

It can be shown that this approximation is sufficient for all practical purposes 
provided 

ft> 2, t = 1,2,...,*. 

Substituting (14) in (10), expanding e E(A) and integrating the single terms 
we obtain* 


00 / 7 -j \ 

ijr(x) £ 2 _i(/c ' _1 )e“^ Cl+, ' lCj x Sa i 2 _i / 1 (- hi) x i( - k ~® +i e~ ix , 

i-o \ 2 / 


..(16) 

..(17) 


where the a* are the coefficients of the expansion 

g(F c d~ AM 8 

in ascending powers of t. 

Prom (16) it is obvious that (to the degree of accuracy considered) the distribu¬ 
tion of x is a weighted sum of x 2 distributions with degrees of freedom ranging 
between k-1 and oo. We now denote by Pj{X) the probability integral of y 2 
based on j degrees of freedom, i.e. we introduce 


P,( X) = 2-M J* xW-*>e~'*dx. 


.(18) 


We further denote by P(X) the probability integral of our variate x defined 
in (7), viz. fa) 

P(X) = j f(x)dx. .(19) 

* We make use of the well-known integral representation of 1 /JT(»), viz. 


(m- 


A+iai 


.1 [ , 

2 nj A-ia> 


e p p~ z dp. 
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From equation (16) we obtain by integration 

P{X) = 2 o'*iP k _ 1+ii (X) x ( > .(20) 

1=0 \ i =0 / 

where the oc^ are the coefficients of the expansion of 

gS c i i_ /2j\ 

in ascending powers of t. 


3. Tabulation of percentage points 

Equation (20) provides a means of calculating tables of the probability 
integral P(X) (or its 5 % and 1 % points). For the quantities P } (X) are given 
by Elderton’s tables of the probability integral of y 2 , while the coefficients 
are readily obtained from the expansion of (21). 

For practical purposes tables of the 5 % and 1 % points could be prepared. 
These percentage points would depend on three quantities, viz. 


K 


h = 





F 3 ' 


The effect of c 3 is small, and it would be convenient to make 1c and c x the respective 
row and column headings of two-way tables of percentage points, and to prepare 
such tables for two or three selected values of c 3 . It is hoped to prepare such tables 
shortly. 


4. Comparison with U. S. Nair’s expansion 
It would lead us too far afield if we gave here a complete mathematical proof 
of the accuracy of the approximation (20). It is, however, of interest to check 
the accuracy in a few cases numerically. U. S. Nair’s expansion mentioned 
above will be used for this check. The most stringent test of the accuracy of 
equation (20) is established by choosing the / ( small and h large. Numerical 
results have been obtained from U. S. Nair’s expansion (2) in the case/, = / = 2; 
h — 10. The result of the test is given below. 


Lower percentage points of L[ = e~ x l F 



5% point 

1 % point 

(a) Bartlett’s approximation 
(c) U. >S. Nair’s expansion 

0-367 

0-277 

0-376 

0-288 

(d) Equation (20) 

0-378 

0-291 


The agreement between U. S. Nair’s expansion (c) and equation (20), ( d ) is 
satisfactory in this case where the approximation would be expected to be worst. 
For comparison, Bartlett’s approximation (a) is also shown. 
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Testing the Homogeneity of a set of Variances 

5 . The relation between the special case /,=/, /= 1,2,..., k 

AND THE GENERAL CASE 

P. P. N. Nayer has considered the general case, and has provided some evidence 
for believing that this case can be reduced to the special case provided that 
the f t are not too small and not too dissimilar in value. He has suggested 
using the mean of the f t as a substitute for the common value/. It is easy to see 
from the approximation ( 20 ) that there is some truth in Mayer’s conjecture. 
However, it is not correct to use the arithmetic mean. The correct value is given by 



and is approximately equal to the harmonic mean of the f t . If all / > 4, the general 
case of unequal f t can cthoays be reduced to the special case f t = /, no matter 
how dissimilar the/. For if in equation (21) we replace c 3 by and consider 
the function 

we find that the coefficients of this function when expanded in ascending powers 
of t will be approximations of sufficient accuracy to the coefficients a { in ( 20 ). 
The probability integral of X is therefore determined by the quantities k and c l5 
so that the identity of the general case and the special case is obvious, provided/ 
is defined by ( 22 ). 

6. Some remarks on Bartlett’s approximation 
Bartlett (1937) has given an approximation to the distribution of 

- 2 log /i - x. 

He suggests as an approximate test that we enter the table of y 2 for k-i degrees 
of freedom with the statistic 

3 x(k- 1 )/c v 

where c x is given by (15). 

It can be shown that this approximation is equivalent to equation ( 20 ) 
provided is small, so that higher order terms in the expansion ( 20 ) may be 
ignored. For large or moderate values of Jc x , however, discrepancies may occur 
even if all mean squares are based on moderate or large numbers of degrees of 
freedom. We shall confine ourselves here to demonstrating this with the help 
of a single example, viz. / = 5 and k - 30. While for values of k of this order 
U. S. Nair’s expansion is very complicated, equation ( 20 ) yields results which are 
accurate to 3 figures. Below are given the probabilities of exceeding Bartlett’s 
5 % and 1 % values; they are 

6% level 1% level 
True P{X) 0'047 0-0081 

Thus Bartlett’s approximation has an error of 6 % and 19 % respectively. 
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THE SIMULTANEOUS DISTRIBUTION IN SAMPLES OF 
MEAN AND STANDARD DEVIATION, AND OE MEAN 

AND VARIANCE 


By L. TRUKSA 
The Charles University , Prague 


In this study I propose to give the application of “ the conception of the probability 
of passage” to the solution of the rather difficult general problem mentioned 
above. From this single example it is possible to deduce that the introduction of 
“a conception of the probability of passage” into mathematical statistics would 
at least make the solution of a range of difficult problems considerably easier. 

Let us assume that the class symbol of the statistical element is a continuous 
two-dimensional variable x, y, defined in the region Q, and that the corresponding 
density of the probability of passage from the class x v y x into the class x, y 
expressed by the symbol p t {x v y x \ x,y), depends, not only on the variables x,y 
but also on the discontinuous variable t, which is the number of operations 
executed on the statistical element. By operation we shall mean in our case the 
selection of a statistical element from the fundamental universe. 

Let the function pfa, y x \ x, y) satisfy the following fundamental relationship: 



(U 


A further relationship, which will be used, concerns the calculations of the 
continuous two-dimensional probability distribution F l+l (x,y), corresponding to 
the number of operations f+1, from the distribution F(x x ,y x ) by means of 

P p 


U x >v) 


F KyiM*vyv*>y)fai&yv 


( 2 ) 


The problem of the simultaneous distribution of the mean and standard 
deviation of samples in the case in which the fundamental distribution is given 
quite generally by a function/(x), has, so far as I know, occupied the attention 
of only one man; this was A. T. Craig (1932) in his study: “The simultaneous 
distribution of mean and standard deviation in small samples.” He introduces 
the solution only for samples of a very small number of items, n = 2,3,4. 

The use of the conception of the probability of passage enables us to demon¬ 
strate successively the solution of this problem for samples with an increasing 
number of items. The method used gives us at the same time a solution in a very 
easy manner, and especially clear, if used with a graphical illustration. 
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I 

Let us use x for the mean of a random sample of the symbols x 1 x 2 ... x t taken 
from the continuous one-dimensional universe of density f(x), where we have 

1 1 

* = (3) 

t 1 

and for the standard deviation s, which, in accordance with the definition of the 
standard deviation, is given by 

s 2 = 7 Sfo-*) 2 . (3-1) 

t i 

By an extension of this sample of size t to the size t + 2, we get a sample, the mean 
of which is 

(M) 

and the standard deviation S is given by 

S 2 =^|(*ri) 2 - (3-3) 

The elementary probability of passage p t {x,s; X, S) from the sample with x 
as mean and standard deviation s to the sample of mean X and standard deviation 
S equals the probability of the appearance of the values x l+i and x (Vi ; thus 

/(*<+ i) dx l+1 f{x l+2 ) dx M = p t {x, s;X,S)dXdS. (4) 

In this expression it is necessary to substitute for the variables x l+1> x t+i in 
terms of the variables X, S\ the values x, sin substituting being taken as constants. 

From the expressions (3) and (3-2) we obtain, first of all, the following relation¬ 
ship: 

%i + %2 = X{l + 2)-xt. (5) 

A further relationship which we obtain from equations (3T) and (3-3) is 

(t + 2) S* = S (x t -Xf + (x t+1 -1) 2 + (x M -Xf 

l 

= Ix\ — 2tXx-{- (t + 2) X 2 +xf +l + xl\. 2 — 2{x l+1 + £ty +2 ) A. 
i 


If we now use the relationship 

fs 2 = Ix\-tx i ) 


we obtain another equation, 
x t+1 and x i+2> i.e. 

xf n + x 


which is necessary for . the determination of the values 
= (t + 2)(S* + X*)-t(s*+x*). (5-1) 
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From equations (5) and (5-1) it then further follows that 

% x V{2[(«+ 2) £ 2 - ts 2 ] - t(t + 2) (X - *) 2 } = ~ X - y ± 

- 

x M = X - lx +\ V{2[(«+ 2) S *- ts *J - t(t + 2) (I - x)*} =t^x-~x + l a> 

(5-2) 

where, for the sake of brevity, we introduce the symbol 

a = V{2 [(< + 2) 8 * - te 2 ] - i(f + 2) (X - z) 2 }. 

For effecting the substitution in expression (4) it is also necessary to know 
the corresponding determinant of the substitution 

% 2 ) = __ (t + 2)*S 

D(X, 8) 2 f{2[(t + 2 )S*~ is 2 ] - t{t + 2) (X - ir) 2 } * 

Referring to the two different values for each of the symbols x l+l and % M 
we then get the expression for the density of probability of passage p t (x, a; X, S) 
in the form 

Let Ffx, s) be the density of the simultaneous distribution of mean ie and 
standard deviation s in the random sample of size t. 

The density of the distribution of mean X and standard deviation S in the 
random sample of size i + 2 is obtained by application of the fundamental relation¬ 
ship (2), and for its value we get the following expression: 

F t+i {X, 8) = s)p ( {x, s; X, S) dxds 

= 2 ^ +2)2 Ll^ ,s)/ (^ 2l "^ +ia 

(7) 

For the complete solution of this recurrence relationship, besides the limits of 
integration (which we shall consider later on), we need to know the initial values 
of the function F t (X, 8), i.e, 

F l {X,S)=f(X), 1 

F 2 (X,S) = p a (x,s‘ ! X,8) = 4 f(X + S)f(X~8).) ( ' ) 

Let us supplement these values with the following function F S (X, S) for which 
only one integration is necessary: 

F,(X,S) - X8sf/(i)/[}2-J*+W(6#-3(r-s)>}] 

the limits of integration will be deduced later on in this study. 
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In order to examine the limits of integration, 

( 1 ) Let us first of all assume that the density of the fundamental universe is 
expressed by the function f(x), defined between the limits ± oo. 

In this case we set only one condition on the values x t+1 , x l+i , that is that they 
must be real. The condition corresponding to this is expressed by the inequality 
2{t + 2 ) S 2 - 2 te 2 - t{t + 2 ) (1 - xf> 0 . 

If we consider X, 8 as constants and x, s as variables and we use rectangular 
co-ordinates with axes x, s, these variables are limited to the region given by that 
half of the ellipse E n : f ,9 # , 9 

which lies above the axis x. According to the choice of the values X, S, the variable 
x varies between the limits + co; the variable s then lies in the range 0 , co. 



0 X 

The centre of the ellipse E 0 lies on the axis of a; at a distance X from the origin 
of the co-ordinates; the semi-axes have the lengths 



The integration of expression (7) with respect to x must therefore be carried 
out between the limits 



and then with respect to s between the limits 



In Diagram 1 the surface of integration is shown for the particular values X = 1, 
S = T5, t = 4. 
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7 2 

-, the semi-major 

axis A a being always larger than A^, their ratio 

A, It+ 2 


A, 


jt+_ 

: V 2 


depending only on the size of the sample. 

If calculation of the value of F 3 (X, 8), is involved, i.e. if t = 1 , the ellipse E 0 
reduces to that part of the x axis with X±S*J 2 as end-points. The corresponding 
integration need only be carried out with respect to x, and that between the 

liraits X±SJ. 2. 

Formula (7) can be used, not only for the successive calculation of the simul¬ 
taneous distribution of the mean and standard deviation for an increasing size 
of random samples, but also for the verification of the given distribution values 
introduced for the arbitrary t. 

Thus, for example, it is possible to check the correctness of the expression 




G 


< 

2 c 2 


l-1 

T 8 l ~ 2 «* 

e 2e 3 


which corresponds to the normal universe 

/(*) = ‘ 


-It) 


c*j2n 

By application of formula (7) we obtain 


e 2<”. 


-5 


(-i 

t\~rtt+ a\* 


7T 


8 


(M- 2)(X 3 +,S’) 

e 2d 3 


c l +*r\ 


(¥) 

r + V(fST2) 


(-1 

t\~T It+ 2\* 

7t 


s { ~ 2 ds 


S 


dx 


? *\a/( 


S 2 s 2 


1 t+2 




(V) 


(t+8)(2C*+S*) 
e 2d 3 


,sV¥ 


c'+ 2 r( 

s^ds 


n /2 


arc sm 


i+i 


X 2 _sM 

•* i + 2 '^ + V(f-iTf) 


V(t+ 2 ) S ‘ 




l 2 c 2 / Jt+l 


(t+2)ffl 
1 e~ 2 e 3 
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As can be seen, the function JJ(Z, S) satisfies the fundamental recurrence 
relationship (7). 

( 2 ) Let the fundamental universe be defined by a continuous function defined in 
the range 0 , oo. 

Under this assumption the values x t+v x t+2 > must satisfy, apart from the 
condition of their being real, another condition expressed by the inequality 
l j _2 — t 

— X ~ \ V{2(< + 2) & - 2 ts 2 - t{t + 2 ) (X - *)»} > 0 ; 

the mean of the sample x cannot then assume negative values, which gives 

x^O. 

The corresponding values of the variables x, s, satisfying the first inequality 
lie outside the ellipse E x 

Z Z 2 Z 

The centre of the ellipse is situated on the x axis at a distance X(t + 2)/(t+ 1 ) 
from the origin, the semi-axes have the lengths 


K = 


7 + 2 


£ 2 - 


Xfi_ 
Hi)' 


a; = 


i + 2 

i(t +1) 


fy-li 

\ (+i 


From the quotient -A = y(i + 1 ) it follows that the semi-axis A' is longer than 
A* 

Aj. The ellipse E x intersects the x axis at the points 


<+1 


t + 2 
f(*+l) 


S 2 - 


Z 2 

't+1 


The ellipse JBj lies inside the ellipse E 0 and touches it at the point x v s l5 where 


- _ 2 
X x — Z , 


7+2 

i 


S 2 - 


?) 




as long as 

From the expression for the lengths of the semi-axes it follows that Zj is real 
only as long as the condition 


8* 


X 


*J{H 1 ) 

is satisfied. 

Considering the condition that the mean x cannot attain negative values, we 
obtain the inequality $ ^ X J(t + 1 ). 

The system of the ellipses E x is therefore real for those points with the para¬ 
meters Z, 8 as co-ordinates, which lie between the straight lines 

S -Wn)’ S = 


and that for Z 0 , 8^ 0 . 
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When carrying out the integration indicated in formula (7), it is also necessary 
to distinguish between the cases when the ellipse E 0 intersects the negative part 
of the x axis and when it does not, i.e. if 

Sixjt or Stxji. 

In Diagram 2, the corresponding regions of the values X, S are shown fori = 8. 



The surface of integration is that part of the plane x, s in the first quadrant 
contained between the ellipses E 0 and E v 
The process of integration is then as follows: 

(ft) Let S satisfy the inequality 

0 


X 

1 ) 


In this case only the ellipse E 0 is real, It is therefore necessary to carry out the 

— II2S 2 2s 2 ' 

integration with respect to x in between the limits X ± 

It+ 2 

respect to s between the limits 0, S 


t t + 2 


and with 


In order to illustrate the theory, formula (7) will be applied to find the 
analytical expression of the correlation surface F t (X, S) for samples from the 
fundamental universe = e ~ x . O<x<00 


over the region in the plane X , S bounded by the inequality mentioned above, 
For t = 2 and 3 the results are given in A. T. Craig’s paper, namely 


F a [X, S) = 4e _2X ; F Z (X, 8) = 6^/3 tt 8^ x . 
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By application of our formula ( 7 ) we obtain 

Fi(%, S) = UnSH-^ ; J 5 (1, 8) = 50 a/5 n 2 S s e- &x . 

Regarding these expressions let us assume the general solution to be 

F t+S = H t+2 8 l e~^. 

A recurrence relationship for the coefficients H t may be easily found from the 
equation 

f +2 

F l+2 {X, 8) = 2S(t + 3) 2 J^ = HfiShtvW* ^ 


= 5, +a ^e-W, 


•^+2 — H^TT 


t+l 

(i + 2)T 


(«-l)< 2 

It is then not difficult to verify that for t ^ 2 


5 


«+i t 
2ir 2 0 


t+i « 


The function 


O 7 r 


(¥) 


flMe-ur, t> 2 . 


represents a part of the whole distribution given by the integral 


t-i 


/V(t+i) _ 2v 2 Ht) 

F t (X,S)dXdS = T 
=0 


coo <V(t 

Jx=oJsM 


* l±i _ 7 \ 

< 2 (t-i) 2 


, 2, 


that is, if t = 2 3 

100 % c.60% 

of the whole distribution. 

( 6 ) If $ satisfies the inequality 

X 


4 

c. 30 % 


5 

c. 13 % 




V(*+!) 

the relative position of the ellipses 25 0 and E 1 is shown in Diagrams 3a, 6 . The 
integral in formula (7) is equal to the sum of the integrals between the following 
limits: 

(a) According to x: 


X- 


2$ 2 2 s 2 


t t + 2 


2 s 2 \ _< + 2 / 

2 + 2,!’ HI V 


2 + 2 

2 ( 2 + 1 ) 


£ 2 -s 2 


( + 2 2 + 1 / 
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o 
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According to s : 



(yd) According to x\ 



According to s: 




(y) According to a:: 

~ty 2 I t-\-2 
t -\-1 sj t(t +1) 


S 2 - 


t + 2 




According to s: 



(c) Finally if X 


S^X*J(t+ 1), part of the ellipse E 0 falls to the left of 

Jj 


the s axis, E t always being to the right of the s axis, the surface of integration of 
the values x, s is made up of two parts, as can be seen in Diagram 4. 

For the purpose of integration it is necessary to divide the integration into 
four parts: a, /?, y, S . The integral of the expression given in (7) is equal to the sum 
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of the integrals with the following limits for the value x, and for s, x to be taken 
first of all: 


(a) The limits of a:: 

0; X 


The limits of s: 

(/?) The limits of x; 

/2/S 2 2s 2 


t + 2 
i +1 

0; 


t -|- 2 

t{T+i) 
t + 2 



Z 2 \ 

b i + 2 

t +1/ 




X ' *' { i + 2 
The limits of s: 

It + 2 


f + 2 - 


+ 2 I t + 2 


i+1 


/S 2 - 


t 

(y) The limits of »: 

1- 

The limits of a: 


i + 2 
/S 2 -~Z 2 

<u 


i(£ +1) 


sH 

Z 2 \ 

^ + 2 

i+ ]/ 


t + 2 

r * 


2S* 2j ! 

“(T2 1 ’ A + 


2/S 2 

t 


2s 2 \ 

i + 2 J 


t + 2 

F 


/S' 2 - 


_Z 2 
1 + 




/S 


't + 2 


(d) The limits of a;: 

t + 2 


F+ 2 i 

2 rn + 


t{t+i) 

The limits of s: 


S 2 - 


o; 


s 2 i Z 2 
{ + 2 t +1/ 
’( + 2 
t 


Z + 


2/S 2 2s 2 
t t + 2 




If the calculation of function i^Z, /S) is especially required, that means, if 
t ~ 1, that there is a substantial simplification in the integration. First of all it is 
necessary to carry out the integration only according to x; besides that it is 
sufficient to differentiate between two cases only, according to whether the 
value S satisfies f X 

0</S<-^ or j 2 $S$Xf 2, 

X _ n 

since obviously the case of- ' " ” 


< /S < X - drops out of consideration. 


SJH 1 ) 

If the first of the given inequalities is valid, it is necessary to carry out the 
integration with respect to x between the limits: 

X-Sf 2; X + Sf 2; in the case of the validity of the second inequality, the 
corresponding limits of integration are: 

0; |l-|V(6^ 2 -3l 2 ), 
f I + £V(6S 2 -3X 2 ); X + S<J 2. 

These results agree with the results quoted in the paper by A. T. Craig 
already cited. 
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(3) Let the fundamental universe be defined in the range 0, a, 

The values x l+x , x l+2 , in this case are determined by the inequality 

6 x i+i> x (+i, ^ ®) 

the mean of the sample satisfies the inequality 

From the upper limit of the values % 1( x l+i it follows that 
t + 2 - t 

~X--x +1 V{2(« + 2 )S*~ 2ts 2 - t(i + 2)(X- S) 2 } < a. 

The corresponding values of the variables x, s, which satisfy this inequality, 
lie outside the ellipse E 2 : 


t -^^x 2 +tx(a-t+2X)+ls 2 = i+gffl- ( < + 1 )( t + 2 ) x*+a(t+2)X-a\ 

L 2 t 2 

Its centre lies on the x axis and is at a distance from the origin equal to 

y« + 2 a 

A t+l t + V 

this distance is always smaller than the distance of the centre of the ellipse E 0 
from the origin. 

The lengths of the semi-axes are 


AI = 


t + 2 

f(i+T) 


s 2 - 


t +1 


a: 


V(*+i)‘ 


The ellipse E t lies inside the ellipse E 0 , and touches it at the point where 


x 9 — X 


t + 2 2 a 

T 


2a lt + 2 

~T’ s * = /J— 


£ 2 -~(a-X ) 2 

t 


as long as 


S>(a-X) J- t . 


If the ellipse E 2 is to be real, then we must have 

a-X 


S2 


V(*+i)' 


From the upper limit a of the mean x of the sample it follows that 

t 2 


yT + 2 a 
A “ - 7“ r “T 


t(t + l) 




1 4 * 1 t~\~ 1 
This inequality leads to the relation 

S^(a-X)J(t+ 1). 

Finally, it is necessary to consider the condition that the ellipses E x and E 2 
must not intersect, in the limiting case they touch at the point 

, t ~f* 2 a 


x = X- 


t t’ 


t+2 0 ^ t+2 = t + 1 

— S* + 2Z -^-{a - X) - a 2 -j- 


0. 
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From this condition there follows the limitation of the permissible values of X, 3 

by the hyperbola H f aX t + 1 

S+2Z—a^j-0. 

For carrying out the integration in formula (7) it is necessary to know the 
condition for the ellipse E 0 to intersect the x axis at a distance A from the origin, i.e. 

J~r 


Diagram 5 



The whole process of integration is quite clearly shown in the graphical 
representation of the individual segments of the permissible values X, S in Dia¬ 
grams 5-7, for t = 1,2 and 3, It is convenient to split up the surface of the values 
X, 8, for which both the ellipses E x and E z exist, into two parts by a straight line 
X " ^ct. 

If X<£a, 

the semi-axis A' s of the ellipse E x is longer than that of the ellipse E 2 , 

If X>K 

the ratio of the lengths of the semi-axes is the reciprocal. 

The description of the limits of integration for x, s is lengthy in tabular form 
and is better given by means of diagrams. 

In the special case, t = 1, the integration is carried out only according to x, 
for the values X, 8 (see Diagram 6) in the segment: 

(a) Between the limits; X-8f2; X + SJ2. 

(ft) Between the limits: 

0; fX-^(6S 2 -3X 2 ) and fX + ^^-gX 2 ); X + 8^2. 
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( 7 ) Between the limits: 

I-SV2; —and ~^ + iV{«-8 > -3(a-jC)*}; a. 
(5) Between the limits: 

°! (a*) 2 } and ‘^p + l^-‘i(aX)^, 

3X 

~2 2 V {®^ 2 ~ 3 A 2 } and f X + 11 \/{ 6 /S 2 — 3X 2 }; ft. 



As a simple illustration let us find the correlation surface F t (X, S) of samples 
of t items drawn from the distribution 

/(*) = S 0<a<« 

0/ 

over the region bounded by the straight lines 

s = 0, S-JL-, S-” - * 


V((-i)’ #-i)' 

Using the results given by A. T. Craig, 

«x»)=4 

U/ W* 

we find, by our formula (7), 
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and then by similar reasoning as in example on page 262 we obtain 

t~i i 


m>s) 


27T 2 i 2 



/S'- 2 . 


This function is independent of the variable X. 

As to statistical theory, the problem of the correlation surface F t (X, S) is 
solved by our formula (7) for any fundamental population f(x), but as to applica¬ 
tion on a special distribution f(x), I have not overcome all the difficulties of 



integration. Nevertheless, I feel, the approach being a new one, this study may be 
of interest to statisticians and I hope perhaps that some mathematician will see 
how to solve the problems that I have left uncompleted. 


II 

Let us take the density of the simultaneous distribution of a mean * and 
variance s 2 = u, when the size of the sample is t, as G t (x, u). By an application of 
the same process as in Part I we get the fundamental recurrence relationship 
for the successive calculation of the values G t (x, u): 


G t+i (X, U) = (H2) 2 j^ + 


\dxdu 


a 


( 8 ) 



L. Texj-ksa 


271 


The corresponding initial values are expressed by the following relationship: 

e a (i,tf) = 4/(X+VW-Vtf)'f <8 ' I) 

For the calculation of the next value 0 3 (X, U) we need to carry out only one 
simple integration: 

0 3 (X, U) = $jf(x)f[§X-& + y{W-2u-<>(X-x)*}] 

x/B 

The limits of integration for the different ranges of the fundamental universe 
can be deduced by the method given in Part I. At the same time the three ellipses 
E 0 , E v E 2 are replaced by the three parabolae, P 0 , P x and P 2 : 


t- f- 2 

~ 


{x-Xf + u 


t + 2 


U, 


t(t+ 1) =2 < + 2 (t + l)(i + 2) = 

—-—x -t(t + 2)xX + -u- -g- U - - 

t(t+ 1)_ 1 +' 2 tt (*+l)(f + 2) 

^-^x*-tx(a-t + 2X) + -u= — U — -jp— 


-Z 2 -ta(£ + 2)X-a 2 


In conclusion I must express my thanks to Professor E. S. Pearson for advice 
and several useful suggestions. 
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CERTAIN PROJECTIVE DEPTH AND BREADTH 
MEASUREMENTS OF THE FACIAL 
SKELETON IN MAN 

By ALETTE SCHREINER, Oslo 

1. Definitions of the measurements 

In their study of the “flatness” of the facial skeleton in man T, L. Woo & G. M. 
Morant (1934) lay down no method for direct measurement of the 'transverse 
flattening of the middle part of the facial skeleton, i.e. of the part made up by the 
malar bones and corpora of the maxillae, In studying a number of crania with 
differently shaped facial skeletons, it occurred to me that the best expression of the 
degree of projection of the middle part of the facial skeleton might be obtained by 
expressing the projective distances of the zygomaxillary and zygotemporal points 
from the most posterior points on the margins of the pyriform aperture as per¬ 
centages of the lengths of the chords between the corresponding bilateral points. 

It is true that no precise points of general validity for this purpose can be 
indicated on the “nasolateral” margins. It is also the case that we often find 
asymmetry here, though hardly more than in other parts of the facial skeleton 
on which routine measurements are taken. However, the advantages offered by 
the most posterior points on the margins for subtense measurements appear to 
me to outweigh the disadvantages. It is a factor of some importance that the two 
subtense planes in question are almost coincident and approximately horizontal. 

After having taken some test measurements with a pair of ordinary co-ordinate 
callipers, I came to the conclusion that this instrument was unsuitable for my 
purpose. I therefore decided to undertake a preliminary research with the object 
of testing the value of the method, disregarding the disturbing factor of asymmetry. 
For this purpose I designed, and ordered from P. Hermann of Zurich, a special 
pair of callipers with two parallel arms, both of which might be moved in directions 
at right angles to the bar to give readings of the distances of the tips from the bar, 
According to my design the tips were to be slightly blunt, and the bar was to be 
only 2 mm, thick with blunt edges on its working-face. Owing to the trouble and 
expense involved in the construction of an instrument of this kind, P. Hermann 
sent for my approval one of a set of twelve pairs of callipers which he had made some 
years ago to the order of R. Pooh, some of which were still in his possession (Fig. 1). 
In most respects the design is similar to my own, but the bar is 3 mm. thick and the 
edges of the working-face are sharp, Furthermore, the scales of the arms are fixed 
in such a manner as to necessitate correction of the readings of the subtenses, 
whereby 4-3 mm. (controlled by the Weights and Measures Standards Office, 
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Oslo) must be added to the readings. In spite of these drawbacks I decided to keep 
the instrument, and; though it is not ideal for the purpose, it can be used effectively. 
I do not know for what kind of measuring it was originally designed. 

For measuring a chord and its subtense I first set both arms of the instrument 
at equal lengths, longer than the subtense, and place the tips in contact with the 
extremities of the transverse chord. I then fasten the screw, draw back the arms, 
place the working-face of the bar lightly but firmly against the nasolateral margins, 
and move the arms until the tips again meet the bilateral points. On removing the 
instrument the distance between the arms and the subtenses are recorded, the 
latter after addition of 4-3 mm. If the readings on the two arms are not equal, as 



Fig. 1. Callipers used to measure the breadths and subtenses. 
(The figures on the scale are centimetres.) 


is most frequently the case, the average of the two values is recorded. For my 
method of measuring, however, I do not regard as suitable skulls with conspicuous 
asymmetry of the facial skeleton. 

With my instrument I also took the measurements which give the “frontal 
index of facial flatness” of Woo & Morant in such a manner that both arms were 
of the same, or-practically the same, length. This frequently necessitated re¬ 
measuring. I do not believe that the length of the subtense obtained by my method 
differs from that arrived at by using a pair of co -ordinate callipers of the usual form. 

My measurements are as follows: 

(i) The chord IOW, inner biorbital breadth, fmo-fmo of Martin (1928). 

(ii) Sub. IOW, the subtense of the nasion from the chord IOW. 

(iii) The chord GB, bimaxillary breadth, zm-zm of Martin. 

W 
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(iv) Sub, GB, the subtense of the nasolateralia ( Nl-Nl ), i.e. the line joining the 
most posterior points on the margins of the pyriform aperture, from the chord GB. 

(v) The chord ZB, bizygotemporal breadth, between the two zygotemporalia 
infmoria (ZT of Woo, 1937). 

(vi) Sub. ZB, the subtense of Nl-Nl from the chord ZB. 

(vii) FB, the bizygomatic breadth, zy-zy of Martin. 

Before passing on to the indices, I must comment on the fact that the location 
of the zygomaxillary point ( zm ) gave me a certain amount of trouble. I did not 
always adhere to Martin’s definition of it, which specifies absolutely the lowest 
point on the zygomatico-maxillary suture. Woo & Morant adopt this definition, 
adding that “if the inferior extremity of the suture is a short length lying parallel 
to the horizontal plane, the anterior point on it is the one accepted What is the 
position, however, if the length is not quite short, and if it does not lie quite parallel 
to the horizontal plane, as when the margin is rough and irregular 1 The fact is 
that we are dealing here with a region of the facial skeleton which reveals a par¬ 
ticularly high degree of variability, due primarily to differences in the mode of 
origin of the anterior part of the masseter muscle. In many cases the origin does 
not reach the maxilla, and where this is so location of the point presents no diffi¬ 
culty; but in other cases, and probably for some races in the majority of cases and 
particularly in males, the origin continues for different lengths on the lower border 
of the maxilla, and it often shows fairly strong impressions there. This last con¬ 
dition was frequently found in the case of the Norwegian skulls which I have 
examined. The same is true for the Eskimo and Australian specimens, but it was 
foundmuch less frequently in the case of the skulls of Lapps and those of some other 
races. There appear to be racial differences in this respect. 

In some cases the lowest point on the suture lies rather far back on a broad 
and rugged surface. Such a point would appear to be quite useless for the purpose 
of measuring the projection of the facial skeleton, and in fact it may not always 
be possible to reach it with the tip of one of the arms of the callipers when the bar 
is placed on the nasolateral margins, In view of this I was compelled to draw up 
a new definition of the zygomaxillary point, viz. the lowest point of the zygo¬ 
matico-maxillary suture which still lies on the anterior surface of the bones. The 
adoption of this definition may occasionally alter the length of the chord GB by 
a small amount, but this is of little significance in comparison with the change 
which it will sometimes make in the length of the subtense. 

The definition of the zygotemporal point also seems rather uncertain. Woo 
(1937) defines' ZT as “the lowest point on the zygomatic suture which is still on 
the lateral surface of the arch”. In some cases, however, it is very difficult to say 
where the lateral surface of the arch ends. The section of the arch shows consider¬ 
able variation in form. Rectangular sections which make possible an absolutely 
precise location of the point are somewhat scarce. More frequently the zygoma 
has a lateral and a latero-inferior surface which are more or less indistinctly separ- 
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ated from each other. In such cases I have located the point to the best of my 
ability on the borderline between the two. In most cases I have found the chord 
to be 2-3 mm. shorter than the greatest possible breadth between the sutures on 
the two sides. Although the question is of much less importance here than it is 
for the zygomaxillary point, a more precise definition is nevertheless desirable. 

Should the method which I suggest for measuring facial projections in relation 
to the nasolateral margins be generally adopted by craniologists, it will be necessary 
for them to find indisputable definitions to give the exact location of the extremities 
of the two chords in question. 



fig. 2. A horizontal section of the facial skeleton illustrating measurements taken. 


The indices used can be divided into two classes, the first (Nos. viii-xi) in¬ 
volving subtenses and the second (Nos. xii-xvi) being ratios of pairs of the trans¬ 
verse breadths. They are: 


(viii) SFi, 

(ix) SMi, 

(x) SZi, 

(xi) SSi, 

(xii) GOi, 
(xiii) ZOi, 

(xiv) GZi, 

(xv) GFi, 

(xvi) ZFi, 


100 Sub. IOW/IOW = 
100 Sub. GB/GB = 

100 Sub. ZBjZB = 

100 Sub. OB/Sub. ZB 
100 GB/IOW 
100 ZBIIOW 
100 GBjZB 
100 GB/FB 
100 ZBjFB 


frontal index of facial flatness, 
maxillary index of facial flatness, 
zygotemporal subtense index, 

maxillo-orbital breadth index, 
zygomatico-orbital breadth index, 
maxillo-zygotemporal breadth index, 
maxillo-facial breadth index, 
zygomatico-facial breadth index. 


Finally, I have calculated approximately the angle between the vertical plane 
through the zygomaxillary and zygotemporal points on either side and the median 
sagittal plane (see Fig. 2). Assuming that the two planes zm-Nl-Nl-zm and 
ZT-Nl-Nl-ZT are horizontal and coplanar, the fraction (ZB-GB)j'2(sub. ZB-sub. 
GB) is the tangent of the “zygomatic angle” {Zl). 
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A metrical description of the curvature of the inferior margin of the malar 
bone would be of interest, but my attempts to measure it have been unsuccessful. 
A survey of such a feature would appear to be possible only by using complicated 
projective methods. 


2. The cranial series measured 

My main material consists of 100 male and 100 female Norwegian skulls from 
medieval churchyards in Oslo, and 100 male and 100 female skulls of Lapps ob¬ 
tained from various cemeteries in the county of Finmark, the majority of these 
being of fairly recent date. Most of the Lapp skulls form part of the material 
already dealt with by K. E. Schreiner (1931-5), but some of them have been 
acquired at a later date by the Anthropological Institute, Oslo. All of these are 
brachycephalic specimens. The Oslo skulls form part of the material which has 
been described by the same author (1939). I have looked through his records and 
have omitted the small number of skulls with a cephalic index greater than 79'9, 
or an upper facial index less than 50-0, 

The series of foreign skulls in the possession of our Institute are all too small, 
and for the most part of too miscellaneous a nature, to provide results of any 
importance. I have, nevertheless, examined the best of them for the purpose of 
obtaining comparative data. I measured twenty-five male and twenty-five female 
Eskimo skulls. The majority of these came from Greenland, but two of the male 
and six of the female specimens are from the opposite coast of Labrador. I have 
found no distinct differences between the two local groups. The other series which 
I measured are: 

Ten male and seven female Indian skulls from different parts of America, all 
of which have common features while none shows artificial deformation; 

Eleven male and eleven female Negro skulls from different parts of Africa, all 
being clearly dolichocephalic; 

Thirteen male and nine female native Australian skulls from different parts 
of the continent; 

Twelve male and nine female Maori skulls from a single cave on the North 
Island of New Zealand. 

The Australian and Maori skulls form part of the material dealt with by 
K. Wagner in his great work (1937). I have only included skulls sufficiently com¬ 
plete to provide all the measurements. 

3. Sexual comparisons 

Table I gives all the means which I have calculated. We will first examine the 
question of differences between the sexes. In Tables IIa and b sex ratios expressing 
the female means as percentages of the male are given for all the characters and 
groups, and in the lower sections of the same tables the differences between the 



TABLE I 



fei 



<3$ CO l> t- OS r-H 

« « cq w « 



<o 

03 

CD 

03 

CO 

-«* 

I 1 i 

CO 

L"~ 

<o 

lO 

CO 

cO 

CM 

IO 

cb 

tr« 

cb 

lA- 

[--*• 

cb 

03 

03 

03 

03 

03 

03 

03 



fN QO \f< CO rt< OO QO 


OO 03 t*i tO t*I GO 03 
(M IM tM fN (M tM (M 


CO CO ^ CD CO CO CO 
03 03 03 03 03 03 03 


MNrHiOOON 
t' l> I> l> t'* l”' 1> 



'0 

CQ 

O M o <3> <N CD 05 
t>* tH C- (31 co L^- 
rH O -<H 03 1© O0 6 
^ ^ (N CO M CO ^ 

«Tt((NiOCit>'l> 

IQ t3l (M CO ^ CO ^ 

»H t^THaDQC^CO 

co cq co m co 


r—H r— 03 03 0> 03 to 



O co o oo cq co (M 

co it co in co i- w 

CQ 

oo co cb cb p-i cb cb 

IT* T*i T^ It o 03 CO 


M» M ffC « TH cc o 

« fO CO CC Tti CO cc 


Til CO 03 CO i—< 03 lO 

O M) and IM ^ 05 


r— 1 to 03 03 CO CO aO 

CO <34 OO CO O CO CO 

pi 

*—1 do O OO 03 o 03 

o t> 6 oo ti oo oc 


C<l r— 1 r— C r— 1 r-H cq r— < 

<M »-H r—« r— < eq r— -l r-~< 



95-00 14-18 95-00 17-76 127-33 46-16 








































































TABLE IIb 

Sex ratios (female meanjmale mean) and' mean differences for the indices 
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means for the two sexes, together with the standard errors of these quantities, are 
provided for the Norwegian, Lapp and Eskimo series. As will be seen from Table 
IIa, all the male means of absolute measurements are greater than the corre¬ 
sponding female means, except in the case of Sub. GB for the Negro series. Some 
of the differences for the three longest series are not statistically significant. This 
is so for Sub. IOW in the case of the Norwegian and Eskimo series, although the 
same difference for Lapps is markedly significant. This is an unexpected conclusion 
and no great reliance can be placed on it, particularly in view of the fact that 
K. E. Schreiner has found that in general sexual differences are smaller in Lapps 
than in Norwegians. The peculiarity noted is undoubtedly due to the mixed com¬ 
position of the Lapp material. Our Lapps do not form a homogeneous population, 
being mixed at different places in different degrees with Norwegians and Finns 
(Quains). The skulls were collected in various localities spread over a wide area. 
Several of the component local series are small and the sexes are unequally 
represented in them. I have calculated the sex ratios of tins character separately 
for all the local groups and have found fairly large differences. Some of the group 
means (Kautokeino, Karasjok and Kistrand) give low sex ratios, due, presumably, 
to the fact that the male means are too high to be characteristic of pure Lapps. 

The last column of Table IIa shows sexual comparisons of the zygomatic angle. 
In all series except the Negro the female mean is distinctly smaller than the male, 
although the difference is insignificant in the case of the Norwegian series. 

Sexual comparisons of the indices (Table IIb) are of greater interest. In accord¬ 
ance with the results of Woo & Morant, the frontal index of flatness (SFi) shows 
no significant differences, although it appears to have a slight tendency to be lower 
in the female sex. This is also true for the maxillary index of flatness ( SMi), whereas 
the zygotemporal subtense index (8Zi) shows a slight tendency to be greater in 
the female sex (cf. the relations of ZL), The means of the SS index indicate, in 
accordance with those of the SM index, that the maxillary region tends to be 
somewhat flatter in females than in males. The values of the different breadth 
indices show very small sex differences, but nevertheless they show fairly con¬ 
sistent relationships, In all groups the biorbital breadth is a little larger relative 
to the bimaxillary breadth in females than in males, The throe breadth measure¬ 
ments of the middle part of the facial skeleton (GB, ZB and FB) show a slight 
tendency to be relatively larger anteriorly than posteriorly in female compared 
with male skulls (cf. ZL). In my material this relation applies particularly to the 
Eskimos. It may be noted that for their larger series Woo & Morant found a dis¬ 
tinctly lower mean than I have for the bimaxillary breadth (GB) in female Eskimo 
skulls. The sex ratio of tiffs measurement in their material is only 92-7, as against 
95-2 for the biorbital breadth (IOW). 
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4. Racial comparisons 

Of the breadth measurements, the internal biorbital (IOW) shows the smallest 
differences between the series and the bizygotemporal (ZB) the largest. If the 
short Negro and Australian series are disregarded, the smallest means of all the 
breadth measurements are found to be the Norwegian. Woo & Morant give a 
slightly lower mean for the biorbital breadth, and a considerably lower mean 
(3-4 mm. less) for the bimaxillary breadth, for their nineteen Norwegian skulls 
than my values for the medieval Oslo series. The values found by these authors 
for male Anglo-Saxon and medieval English skulls are slightly higher for the 
biorbital breadth, and slightly lower for the bimaxillary breadth, than my 
Norwegian values. With regard to the two breadth measurements in question, 
there is little difference between the Oslo and Lapp series, particularly in the 
case of the male skulls. It may be noted that for the 140 male Lapp skulls 
K. E, Schreiner has given a mean bimaxillary breadth of 95-5 mm., which is 
0-8 mm. less than my value for male Lapp skulls and practically the same as my 
value for male Norwegian skulls. This author has also calculated for 121 female 
skulls a mean (91-7) which is slightly lower than mine. The difference between 
our values may be due to some extent to a difference in locating the zygomaxillary 
point. However that may be, the differences between my female means for 
Norwegian and Lapp skulls cannot be considered statistically significant in the 
case of either IOW (0-81 + 0-51) or QB (M3 ± 0-56). The bizygotemporal breadth 
(ZB), however, is significantly greater in the Lapp than in the Oslo skulls. The 
means of all the breadth measurements of the middle facial skeleton are clearly 
greater for Eskimos than for Lapps, and the biorbital breadth shows differences 
of the same sign, though they are much smaller. It is worthy of note that this 
last breadth is greater, or at all events not smaller, in the remarkably narrow¬ 
headed Eskimos than in the broad-headed Lapps. The bimaxillary breadth is 
probably greater in Eskimos than in any other human race, and among the known 
peoples of the earth only the American Indians appear to approach them in this 
respect. 

Just as the breadth measurements increase from Norwegian to Lapp and 
from Lapp to Eskimo type, so, to a still greater degree, do the subtenses to IOW 
and OB decrease. Consequently the indices derived from these measurements 
show marked differences. My means of Sub. IOW and of the frontal index of 
flatness for male Oslo skulls are higher than those found by Woo & Morant for 
their Norwegian series, and they accord better with the values given by these 
authors for Swedes. On the other hand, their means of the two measurements 
for Eskimos.are greater than mine. 

With their exceedingly low means for the index of maxillary flatness (SMi), 
the Eskimo differs greatly from the other groups examined, including the Indian 
which otherwise hear some resemblance to the Eskimo. In fact, I know of no 
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characters more capable of indicating the peculiarity of the Eskimo skull than 
this index taken together with the chief cranial indices. Nevertheless, the 
Eskimos are extreme in nearly all characters, and the question remains of the 
extent to which the index can be considered of value for racial classification in 
general. I can only contribute a little towards the solution of this problem by 
comparing my Norwegian and Lapp skulls. The means of the index of maxillary 
flatness reveal the following differences: $2-56, $3-43, with standard errors of 
0-40 and 0-38, respectively. The differences are of marked significance. 

The zygotemporal subtense [Sub. ZB) and its index ( SZi ) show entirely 
different relations. They are associated less with the “flatness” of the facial 
skeleton than with the antero-posterior lengths of the calvaria, and facial skeleton. 
Among the groups which I have examined the Lapp skulls of both sexes have the 
smallest means for the subtense. The Eskimo values are distinctly higher for the 
subtense, but owing to its considerable zygotemporal breadth the type has the 
lower index. The Norwegian skulls have distinctly higher means, both for the 
absolute measurement and for the index, than Lapps and Eskimos, but their 
means are exceeded by those for the prognathous Negro and Australian skulls. 

The index SSi, which gives Sub. QB as a percentage of Sub. ZB, shows the 
highest means for the Oslo skulls and, as a matter of course, very low means 
for the Eskimo skulls. 

Among the indices which relate the different breadth measurements to one 
another, the maxillo-orbital breadth index (GOi) would appear to be of value for 
racial classification, as it appears to differentiate families of races. Judging 
from the means given by Woo & Morant for the biorbital and bimaxillary breadths 
the index lies below 100—--that is to say, the former is greater than the latter 
breadth—in all European, southern Asiatic, the Australian and the majority of 
African populations, while it exceeds 100 in the case of eastern Asiatic, the maj ority 
of American, and a few African and Oceanic populations. As regards my material, 
the Lapps appear to deny their presumed Mongolian origin, since their values for 
this index do not exceed the Norwegian to a significant extent, while the Eskimos 
and Indians have means distinctly above 100. 

The ZO index rises considerably on passing from the Norwegian to the Lapp 
and then to the Eskimo series. As regards the other indices of breadth, I will 
merely refer to the low values for the ratio of the bimaxillary to the facial breadth 
(GFi) of Lapps, who are not only distinguished by their weak mandibles, but also 
by their weak maxillary bones in contrast to those of Eskimos. 

Finally, something must be said of the zygomatic angle. As the calculation of 
this is based on four different measurements, its value may be influenced by as 
many errors. Moreover, the two subtenses which are used do not lie in exactly 
the same plane. Even granting these defects, the angle, nevertheless, is fairly 
expressive as illustrating a feature which is not measured by the indices. I have 
previously dealt with the sexual differences, and will here confine myself to calling 
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attention to the difference between the means for Lapps and Eskimos. Both 
these types are characterized by broad and flat faces with what are called “high 
cheek hones The low values for Eskimos may appear to be unreliable, but doubts 
as to this will disappear on inspection of the skull of an Eskimo from the base. 
The unusual breadth of the Eskimo face is due partly to the great bimaxillary 
breadth, and partly to the considerable curvature of the malar bones, but the 
posterior parts of these bones are long and the zygomatic arches only protrude 
slightly in a lateral direction. The bimaxillary breadth of the Lapps, on the other 
hand, is distinctly smaller, the malar bones are rather short in a transverse 
direction and the arches protrude much more laterally, 


5. Variabilities 

In Tables IIIa and b will be found the standard deviations, while Tables 
IVa and b give the coefficients of variation, for the Norwegian, Lapp and Eskimo 
series. The bimaxillary (GB) is the most variable breadth and the bizygotemporal 
(ZB) is the least variable, while the bimaxillary is the most variable subtense- 
judging by coefficients of variation—and the bizygotemporal is the least variable. 
In the case of the subtense indices, also, the index of maxillary flatness (SMi) 
varies most, while the zygotemporal index varies least. These relations are 
undoubtedly due to the great variation in the form of the facial skeleton in the 
region of the zygomaxillary suture, which affects all the indices involving the 
bimaxillary breadth. As a test of sexual and racial differences in variability I am 
restricted to my Oslo and Lapp material. With regard to this question, I am 
bound to admit that, as long as we cannot count upon absolute accuracy in sexing 
skulls, very little emphasis can be laid upon any difference found. In my material 
male variability tends on the whole to be slightly greater than female, both as 
regards absolute measurements and indices, except in the case of Sub. IOW. 

Table V gives the average coefficients of variation in the two male and two 
female series for three breadth measurements (IOW, GB and ZB), for the three 
corresponding subtenses, for the three corresponding subtense indices (Nos. 
viii-x), and for the three corresponding breadth indices (Nos, xii-xiv). Averages 
are also given for all six absolute measurements, for all six indices and, finally, 
for all twelve characters. At the bottom of the table the corresponding averages 
are recorded for the male plus the female constants. It will be observed that most 
of the male averages slightly exceed the female. As regards racial differences, the 
average for all six absolute measurements is greater for Lapps, but the average 
for all six indices is greater for Norwegians. All the differences between the 
averages for the two series are, however, very small. 
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Standard deviations for the absolute measurements and ZL 
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TABLE IVa 

Coefficients of variation for the absolute measurements 
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TABLE V 

Average coefficients of variation for different sets of characters 


Series 

3 Breadths 

3 Sub¬ 
tenses 

3 Subt. 
indices 

3 Br, 
indices 

6 Absolute 
measure¬ 
ments 

6 Indices 

All 12 
characters 

Norwegian J 

4-05 


11-02 

4*68 

7*69 

7*85 

7*77 

0 

)) ‘I- 

3-89 


10-97 

4*11 

7*52 

7*54 


Lapp cl 

4'92 
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11-27 

4*20 

8*17 

7*74 

SI 

„ 9 

3*87 
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10*76 

3*98 

7*68 

7*37 
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Norwegian+Lapp J 

4-49 

11-38 

11*15 

4*44 

7*93 

7*70 

7*86 
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3*78 

11-41 

10*87 

4*05 

7*60 

7*45 

7*53 
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11*00 

4*40 

7*65 


7*65 

Lapp d+9 

4*30 

11-55 


4*09 

7*93 

7*56 

7*74 


(i. Conclusions 

The chief result yielded by my study is that measurements of the subtenses 
from the most posterior points on the margins of the pyriform aperture to certain 
other bilateral points would be a valuable addition to the routine technique 
followed in describing series of skulls. Of these measurements the maxillary 
index of facial flatness appears to be the most useful, but the maxiilo-orbital 
breadth index also gives comparisons of considerable interest, The adoption of 
these characters in craniological research will necessitate more precise definitions 
of the zygomaxillary and zy go temporal points than those used hitherto. Further¬ 
more, a new instrument should be designed for measuring projections from the 
nasolateral margins. 
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TRANSPOSITION OP THE VISCERA AND OTHER 
REVERSALS OP SYMMETRY IN MONOZYGOTIC 

TWINS 

By E. A. COCKAYNE, D.M., E.R.C.P. 

The twins, Eileen and Joan C., were first brought to my notice by Dr Reginald 
Lightwood, who saw them when they were six years old at the Hospital for 
Sick Children, Great Ormond Street, and found that one had dextrocardia and 
the other was normal. He remembered that they were very much alike and 
thought they were monozygotic. With some difficulty I got into touch with 
them, and Dr James Graham, Assistant County Medical Officer, Essex C.C., 
kindly examined the parents and the surviving brothers and sisters and found 
that in all of them the heart was in the normal position and no cardiac abnor¬ 
mality was present. The other children are: Eric, aged 21; Albert, aged 18; 
Doris, aged 16.; Gladys, aged 12; Betty, aged 11; John, aged 9; and Iris, aged 
5 years. One boy died in 1919, aged 3|-, of lymphatic leukaemia. The twins, 
born in August 1924, were 13 when examined. The parents are English and are 
not blood relations. No other case of transposition of the viscera is. known to 
have occurred in the family. 

The mother says the twins weighed 3 lb. at birth and that there were two 
afterbirths, i.e. each had a separate placenta. Clinically the heart is on the right 
in Eileen and on the left in Joan, and the size and sounds are normal in both. 
An electrocardiogram of Eileen taken by Dr J. L. Lovibond at the Middlesex 
Hospital shows inversion of all waves in lead 1, but that of Joan is normal (see 
pp. 290, 291). An X-ray of the chest and abdomen and a barium swallow showed 
dextrocardia with the stomach on the right and the liver on the left side in Eileen, 
and the normal position of the viscera in Joan. 

The twins are very much alike in appearance and, though it is possible to 
distinguish one from the other when together, there is very little doubt that they 
are monozygotic. In view of the rarity of such a mirror image condition a 
number of confirmatory observations were carried out. Eileen is right-handed, 
while Joan is left-handed. Their handwriting is very much alike, but is unformed, 
and it is difficult to say how far the resemblance is due to teaching. B, R, and r 
are formed in exactly the same way in both. 

Miss Ida Mann examined them and made the following report: 

Eileen . Visual acuity 6/6 right and left, Hypermetropia in the right eye and 
hypermetropic astigmatism in the left. In both eyes the error is slightly higher 
than in her twin’s. Retinal arterial pattern dissimilar in right and left eyes and 
does not resemble her twin’s; practically no mesodermal pigment, well marked 
lesser circle and no remains of pupillary membrane. The right eye is the master eye. 
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Joan. Visual .acuity 6/6 right and left. Low-grade hypermetropic astig¬ 
matism in both eyes. Axes oblique. Retinal arterial pattern dissimilar in right 
and left eyes and does not resemble her twin’s. Iris pattern the same in both eyes 
and exactly similar to her twin’s, practically no mesodermal pigment, well 
marked lesser circle and no remains of pupillary membrane. The left eye is the 
master eye. 

Miss Mann says that it is usual to find the retinal arterial pattern different 
in the two eyes of the same person and in corresponding or opposite eyes of mono¬ 
zygotic twins. According to Viggo Eskelund no two persons show an identical iris 
pattern, but a broad classification into types is possible and these types are 
determined genetically. Unfortunately the iris pattern in the twins is a very 
common one. 

Dr Phyllis Kerridge tested their hearing in a sound-proof room and both 
gave very similar graphs, bone conduction being normal and air conduction low 
normal. 

Dr G. M. Morant took the following measurements in millimetres; 


Eileen Joan 

Stature 1429 1437 

Left cubit 396 396 

Right cubit 397 396 

Maximum head length 180 184 

Maximum head breadth 136 136 

Head height (an uncertain measurement) 128 125 

Cephalic index 75-6 73-0 

Bizygomatic breadth 121 121 


R. L. It. L. 

Maximum length of oar 5Q 56 55 57 


Length of phalanges with fingers strongly flexed at metacarpophalangeal joints: 


2nd digit 

92 

92 

94 

93 

3rd digit 

99 

101 

103 

102 

4th. digit 

93 

95 

98 

98 

5th digit 

76 

76 

76 

78 


The girls are about 2| in. below the English average stature for their age and 
social class. The small differences between measurements suggest that they are 
monozygotic. The small difference between head lengths (leading to one between 
the cephalic indices) is probably the most significant. Joan has slightly longer 
digits than Eileen. As all the measurements are subject to errors of at least 
1 mm. it is safest to conclude that they do not provide any evidence of asym¬ 
metry. 

He also took photographs and reported as follows: The photographs are of 
three kinds. 

(I) Profiles. The outlines of the faces of the two girls are remarkably similar, 
their upper lips being unusually short. Eileen’s chin is rather longer and 
straighter than Joan’s. No difference between the ears (not all visible) was noted. 
Both have free lobes, They provide no. evidence of asymmetry. 

(II) Full-face. The photographs are not truly full-face, rather more of the 
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right than of the left being shown in each case. The outlines are very similar, 
Joan having a rather shorter and more pointed chin than Eileen. The teeth 
show a slight anomaly, which is the same in both girls. The central upper incisors 
are crossed, the left overlapping the right. 

A palmaris longus muscle is present in both twins on both sides. Hair colour 
in both twins matches scale 7. The hair whorl was clockwise in both twins. 

Miss David took fingerprints and made the following report: 

A glance at Waite’s tables shows that radial loops are nearly as common as 
ulnar loops on the forefinger. The remaining fingers are all ulnar loops, which are 
the commonest of all patterns. There is a distinct similarity between the counts 
of these ulnar loops and in the actual pattern. In both twins the actual counts 
approximate to the median value of the distance of ridges in loops. The thumbs 
are interesting and at first sight appear very similar, but there is a difference 
as shown in the counts, and the right thumb of Eileen is like the left thumb of 
Joan, while the left thumb of Eileen is like the right thumb of Joan. The most 
striking thing is that Eileen has a radial loop on the forefinger of the right hand, 
and Joan has a radial loop on that of her left hand, and Eileen has an ulnar loop 
on the forefinger of the left hand and Joan has an ulnar loop on that of the 
right hand. 



Right hand 

Left hand 


Joan 

Eileen 

Joan 

Eileen 

T 

23 C 9 

2106 

6 C 20 

11C 24 

1 

UL 13 

RL 5 

RL 5 

UL 13 

2 

UL 13 

UL 14 

UL 12 

UL 13 

3 

UL 12 

UL 12 

UL 12 

UL 13 

4 

UL 13 

UL 12 

UL 13 

UL 10 


UL=Ulnar loop. RL = radial loop. C=composite. 
The numbers are the number of ridges. 


The palmar and plantar patterns are similar, but those of Eileen’s left side are 
more like those of Joan’s right side than those of her own right side and vice versa. 

Dr G. L. Taylor finds that the blood groups in both twins are 0 M N. Both 
are able to taste phenyl-thio-carbamide. The other taste tests were unsatisfactory 
because they were unable to discriminate clearly between sour and bitter. 

At school the twins were in the same class and much alike in mentality, 
Questioned in different rooms with no opportunity of overhearing one another or 
comparing notes each said that her favourite colour was blue, one immediately 
and the other after some hesitation. Neither knew her sister’s favourite colour. 

I think the resemblance in general appearance and the agreement in so many 
independent characters, a number of which are known to be determined genetic- 
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Eileen. Electrocardiograms. 















































































































Joan. Electrocardiograms. 
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ally, proves that Joan and Eileen are monozygotic. They differ from the majority 
of such twins in the extent to which they are mirror images of one another: 

Eileen with complete transposition of the viscera. 

Joan with normal viscera. 

Eileen right-handed with the right eye the master eye. 

Joan left-handed with the left eye the master eye. 

Eileen with a radial loop with five ridges on the right forefinger. 

Joan with an radial loop with five ridges on the left forefinger. 

Eileen with an ulnar loop with thirteen ridges on the left forefinger. 

Joan with an ulnar loop with thirteen ridges on the right forefinger. 

Eileen with thumb ridges of the right hand closely resembling those of Joan’s 
left hand, and with thumb ridges of the left hand closely resembling those of 
Joan’s right hand. 

I can find records of only four similar cases in the literature. Baron (1825) 
showed one before the Section of Medicine of the Academy of Medicine, Paris, 
in December 1825. A short report says that the twin with transposed viscera 
was a girl, who died at the age of 8 days. Kuchenmeister states that the twins 
were monozygotic and that one had normal viscera and the other complete 
transposition. Miller (1893) records a case of monozygotic twins, one with situs 
inversus viscerum and'the other with situs solitus, but gives neither their age 
nor sex. Dubreuil-Chambardel (1927) records male twins, aged 25, identical in 
general appearance, in weight, height, and in physical characters. Each had 
varices on the lower limbs of the same type and developing in the same way. 
One had a harelip on the left, the other on the right, and one had complete 
transposition of the viscera, while the other was normal in this respect. The 
visceral condition was confirmed by radiological examination. I have been unable 
to refer to the original account of the twins recorded by Araki (1934), but Taku 
Komai, though he gives neither their age nor sex, says they were monozygotic 
and that one had normal viscera and the other complete transposition. Taku 
Komai says that this is the fifth case recorded. Possibly he includes that re¬ 
corded by Tamm and later by Betschler. This refers to female twins, stillborn at 
the seventh month, one normal, the other oedematous and hydrocephalic, with 
the heart and aorta on the right side, no lung on the right and a small one on the 
left. The other viscera were correctly placed, but the liver was small and the 
kidneys large. There was one placenta with two cords implanted centrally with 
the umbilical veins anastomosing. 

Newman in his Biology of Twins shows that the armadillo, Dasypus novem- 
cinclus, normally produces monozygotic quadruplets. He makes the following 
observations. At first there are two embryos and later a secondary embryo is 
formed to the left of each primary embryo. Asymmetry of the scutes is not 
uncommon and in the quadruplets all grades of mirror image formation are 
found. There may be mirror-imaging between individuals of opposite pairs 
(primary embryos), but this is much less common than imaging between twin 
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partners derived from one half of the egg (a primary embryo and the secondary 
embryo to its left). In general mirror-imaging between “opposites” is evidence 
of a residuum of a primary bilateral symmetry that held sway in the blastocyst 
before polyembryonic fission began. When the primary outgrowths are formed, 
they are the product of the antimeric halves of the first embryo and should 
therefore show mirror image relations. But a partial physiological isolation of 
the two halves permits a certain reorganization or regulation of new symmetry 
relations, which tends more or less completely to destroy the original symmetry, 
yet often leaving a trace of the latter. Finally when each secondary outgrowth 
organizes its own bilateral symmetry, it tends to lose, partially at least, the 
earlier symmetry relations, and to establish its own mirror-imaging of right and 
left halves. In some cases traces of all three symmetry systems appear in a single 
set of foetuses, but it is common to find only two systems interacting. Newman 
says that no case of visceral reversal was found in spite of a careful search 
through a large number of foetuses. This, he thinks, is due to the fact that 
twinning is initiated and carried out in the ectoderm, and that the endoderm 
becomes involved only passively and considerably later. 

In the case of human monozygotic twins there may be almost complete 
mirror image arrangement of palmar patterns so that the left hand of x corre¬ 
sponds with the right hand of y and vice versa. This corresponds with the rather 
rare mirror-imaging of “opposites” seen in armadillo quadruplets and in 
Newman’s opinion goes far to prove that polyembryony actually occurs in man. 
A commoner manifestation in man.is reversal of symmetry in the pattern of an 
index finger of one twin so that it mirrors the condition in the corresponding 
index finger of the other twin. This reversal of pattern occurs in both hands of 
Eileen and Joan. 

In addition to the ridges on the skin other epidermal structures may show a 
mirror image arrangement, such as naevi and angiomata, and Siemens (1924) 
records an accessory nipple below the normal one, which was on the right side 
in one and on the left side in the other twin. 

Newman believes that monozygotic human twins become physiologically 
isolated at a considerably earlier period than do armadillo quadruplets and 
founds his belief on the fact that there is so little mirror-imaging in the former 
and so much in the latter. He says that as a general rule the earlier the separa¬ 
tion the more complete is the reorganization of the symmetry relations in the 
separate individuals and the less.residuum of the original common symmetry. 

In double monsters of morfozygotic origin there is, according to Dubreuil- 
Chambardel (1927), frequently a hare-lip, which is axial in both or lateral in 
both, and as a rule it is of the same degree, and the hare-lip of one component is 
always a mirror image of the hare-lip of the other. He points out as many others 
have done how often one component of a double monster has complete or partial 
transposition of the viscera, the arrangement being that the apices of the two 
hearts point away from the place where the twins are attached to one another. 
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There can be no doubt that neither the hare-lip nor the transposition of viscera 
in double monsters is of genetic origin. Hare-lip in man is sometimes determined 
genetically, but the incidence in double monsters is far too high for this to be 
true in their case. Newman says that in the case of double monsters fission must 
begin much later than usual and is never completed. While in the armadillo 
polyembryony is the rule and fission is standardized, in man it is the exception 
and there is considerable variability in the period at which fission takes place. 
Transposition of the viscera in man is recessive to the normal arrangement, and 
elsewhere I have suggested (1938)..two ways in which it could affect only one 
member of a pair of monozygotic twins. Apart from the improbability of so un¬ 
usual an occurrence happening in at least five recorded oases, neither explanation 
accounts for the occurrence of other mirror image arrangements, and any genetic 
explanation must be abandoned. If Newman’s views about mirror image forma¬ 
tion are accepted, the very rare cases of human monozygotic twins with a 
mirror image arrangement of the viscera in addition to that of ectodermal 
structures are due to an unusually late fission. The fact that they are so rare 
seems to show that unless fission is very late each twin reorganizes its own 
symmetry and, if genetically normal, the viscera are normal in both, but if the 
zygote is a homozygous recessive for transposition of the viscera, the viscera are 
transposed in both. Apparently few twins in which late fission occurs succeed in 
becoming completely separate, for double monsters with their own mirror image 
symmetry, i.e. with the viscera transposed in one and normal in the other, are 
far commoner than separate twins with this arrangement. The separate twins, 
which do show this arrangement have in fact just escaped being double monsters. 

My thanks are due to Prof. R. A. Fisher for kindly putting the resources of the 
Galton Laboratory at my service and to Dr Julia Bell for her help and advice, 
and to Miss David, Dr Taylor and Dr Morant. I wish also to express my gratitude 
to Miss Ida Mann and to Dr Lovibond. Dr Graham had intended to write a 
report on the twins, and I should like to thank him for allowing me to do so 
instead and for examining the other members of the family. 
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THE HUMAN REMAINS OF THE IRON AGE AND OTHER 
PERIODS FROM MAIDEN CASTLE, DORSET 

By Cl. N, GOODMAN and G. M. MORANT 


1. Introduction 

Excavations at Maiden Castle, the largest fortified site of prehistoric date in 
England, were conducted under the direction of Dr R. E. M. Wheeler during 
four consecutive seasons from 1934 to 1937. The discoveries made were of unusual 
archaeological interest, and they include a number of skeletal remains which 
were preserved with unusual care. The bones available represent 104 individuals 
ranging in age from a foetal to a senile stage of development. This paper is a 
report on the eighty-three skeletons which are sufficiently well preserved to be 
of some anthropological interest, and the periods, sexes and age groups to which 
they are assigned are given in Table I. An unpublished report on the excavations 
by Dr Wheeler contains particulars of each of the 104 individuals, and a section 
by the present writers in the same work gives a summary of the general con¬ 
clusions reached below and fuller descriptions of the mutilated specimens. 


TABLE I 

The periods, sexes and ages of individuals from Maiden Castle whose 
skeletons were measured 



* Including one individual (T 28) classed as Iron Age 0 or Romano-Britiah., 


The metrical study of the skeletons made hitherto has been confined to the 
crania and mandibles, on which all the customary measurements have been 
taken, and to the clavicles and long bones of the arms and legs, described by 
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lengths only. Individual measurements and remarks for the crania and long 
hones are given in the appended Tables VII and VIII; remarks on the mandibles 
are given with those on the crania, but the individual measurements of these 
bones are not provided, Contours of the crania have been drawn, but they are 
not used here, as additional English material of the same periods would be 
needed to furnisli sufficiently reliable type figures. The writers hope to undertake 
a more detailed examination of the long bones and an examination of the other 
bones of the skeleton. 

TABLE II 


Maximum lengths of the shafts of long bones of infants of Iron Age 
date and of modern foetuses 


No. 

Year excavated ... 
Period* 

C1M1 

1936 

A 

phi 

1937 

A 

QII 

1937 

A 

R1 
1937 
A 

pi 

1937 

B 

RII 

1937 

B 

T19 
1937 
B 

0 

1936 

Of 

P 36 
1037 

Gt 

Modern§ 

Modern|| 

Femora 

It. 

_ 

75-1 


78-9 






664 

58b 


L. 

— 

75-1 

— 

— 

86-2 

76-9 

724 

81-5 

— 


58b 

Tibiae 

It. 

— 

64-0 

— 

68-2 

-, 

67-0 




664 

5.1b 


L. 

— 

64-2 

— 

68-8 

— 

674 

68-8 

— 

724 

56-5 

514 

Fibulae 

It. 

— 

6H) 

— 

_ 

_ 

—- 

61-2 

68-0 

__ 

__ 

48-2 


L. 

— 

61-2 

— 

— 

— 

— 

61-2 

— 

— 


48-2 

Humeri 

R. 

70-5 

64-1 

— 

GOB 

744) 

674) 

EH 

714 


584 

534 


L. 

70-7 

64-8 

75-5 


— 

— 

— 

714 

73b 

68-2 

53b 

Radii 

R. 

_ 

49b 

— 


58-2 

524 

51'8 

55'5 

_ 


44b 


L. 

— 

494 

— 


58-0 

— 

5P2 

55-7 


45b 

44b 

Ulnae 

It. 

644 

1>8'2 

— 


66-2 

614 

— 

-- 

, - 


49b 


L. 

— 

58*0 

— 

614 

— 


68-8 

64-5 

— 

53b 

49b 

Clavicles 

it. 

-- 

46-0 

-- 

■ 

48-3 


424 

494 

464 


34-5 


L. 

47 0 

46-0 

— 

H 

484 

— 

42-6 

— 

— 

38'8 

34b 


* A, B and C refer to the chronological divisions of the. Iron Ago. j- Belgic. 
X Bolgic War Cemetery. § Full term. || 7 months foetus. 


Nearly all the skeletons are incomplete, but in the majority of cases the sexes 
of the adult and adolescent individuals can be judged, principally from the 
pelvis, with a reasonable assurance of accuracy. It was suspected at first that 
nine skeletons, which are remarkably well preserved in view of their tender age, 
were foetal. The maximum lengths of the shafts of the long bones for these—the 
measurements having been taken in any direction by using the flat arms of a 
pair of small calipers—are given in Table II. The last two columns on the right 
of the table give the same lengths for a child at birth and a seven months foetus. 
These two specimens are preserved in the museum of-the Department of 
Anatomy, University College, London, and we are indebted to Dr Matthew 
Young for giving us access to them. All nine of the skeletons from Maiden Castle 
are appreciably larger than the two of modern date, and hence it must be 
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supposed that the former are of young infants, probably all from three to six 
months old. No other measurements of them were taken. 

The remaining seventy-four skeletons are available for racial comparisons, 
and the majority of these are of Iron Age date. Thirty represent the population 
of the site at the very end of the period, as they came from a cemetery used for 
the Belgic defenders of the Castle who were slain by Roman invaders in the year 
a.d. 43. Nearly half of the skulls in this series bear witness to the event in the 
form of sword-cuts and other mutilations. The Belgic War Cemetery series is 
just long enough to make statistical comparisons between it and other groups 
worth while, but all the other groups distinguished in Table I are clearly too 
small to stand alone. In some comparisons below a composite series made up by 
pooling all the other individuals of Iron Age or Romano-British date is con¬ 
sidered, either apart from the Belgic War Cemetery series or combined with it. 
This procedure is of provisional value only, of course. There may have been 
changes in the racial composition of the population of Maiden Castle during the 
Iron Age and again in Roman times, and the evidence available is quite in¬ 
sufficient to show whether this was so or not. 

It should be appreciated, too, that it is not unlikely that a small community, 
which may have been peculiar owing to relative inbreeding, may have persisted 
there for several generations, and the characteristics of such a local group may 
mislead if it is taken to typify the large racial population of which it formed a 
special part. The new material from Maiden Castle provides a very welcome 
addition to our meagre knowledge of the physical characters of the inhabitants 
of England in Iron Age times. It may be hoped that it will form a nucleus to 
which other specimens of the period—both those already housed in museums 
and others as yet undiscovered—may be added, until the evidence is abundant 
enough to satisfy the most exacting anthropologist. The object of this paper is to 
provide a descriptive record of the skeletons available with such a hope in view, 
and any results regarding racial relationships made in it are intended to be of 
a tentative nature only. 


2. Non-METRICAL FEATURES OF THE MAIDEN CASTLE SERIES 

The remarks on the Iron Age and Romano-British series given in this section 
relate almost entirely to the skulls. Comments on a few of the long bones which 
exhibit gross abnormalities are given in Table VIII, and the remaining parts of 
the skeletons have not been examined. The Neolithic skeleton of a young man 
(Q 1) was extensively mutilated, and otherwise it is not remarkable. The skull¬ 
cap of the same period is also that of a young man. The Saxon skeleton (Q) is 
male but the age at death cannot be estimated as the skull is missing. The 
following remarks refer to the remaining specimens, which are all of Iron Age 
or Romano-British date. 
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Estimates of the ages at death of the adults were obtained by noting the 
conditions of the coronal, sagittal and lambdoid sutures, and the ectocranial 
aspects of these give the following frequencies: 



All sutures 
open 

Sutures begin¬ 
ning to close or 
partly closed 

All sutures 
closed 

Totals 

■s 

Eelgic War Cemetery aeries 

5 (28%) 

12 (67%) 

M6%) 

18 


Others 

3(21%) 

9(64%) 

2(14%) 

14 


Total series 

8 (25%) 

21 (66%) 

3 (9%) 

32 

s 

Eelgic War Cemetery series 

3(43%) 

4(57%) 

_ 

7 


Others 

7(37%) 

10 (63%) 

2(H%) 

19 


Total series 

10(38%) 

14 (54%) 

2(8%) 

26 


It is probable that all the people buried in the Belgic War Cemetery were 
massacred. The age constitutions of the short adult series are very similar, 
however, to those of the series made up by other people interred at Maiden 
Castle, who probably died from natural causes. The massacre must have been 
carried out without regard to age, though there appears to have been some sex 
discrimination, since twice as many men as women were excavated from the 
cemetery (see Table I). The percentage frequencies for the total series show the 
usual sex differences, due to the fact that the sutures tend to close at an earlier 
age in males than in females. Comparison with values obtained in the same way 
for other series (see Risdon, 1939, p. 107) shows that the Iron Age men died at 
a rather younger age, on the average, than those interred in English cemeteries 
of a later date, but that the women are not distinguished in the same way. In 
considering the frequencies of anomalies, the total series from Maiden Castle is 
referred to below. 

The sagittal suture was normally the first to close, followed by the coronal 
and then the lambdoid. Only one specimen (P 23, male) is definitely anomalous 
with regard to the condition of the principal calvarial sutures. Its sagittal 
suture is obliterated, the coronal is beginning to close and the. lambdoid is open. 
This specimen shows no apparent sign of distortion and its cephalic index (77-4) 
is above the average, so it may be assumed that the sagittal suture was not 
obliterated before maturity was reached, Of thirty-three male frontal bones 
only one (P ii, an isolated bone for which no measurements are given) is metopic, 
and there is only one female metopic specimen out of a total of twenty-seven 
frontal bones. Among Western European crania metopic specimens are usually 
found with a frequency of about 10 %, but the Iron Age series are so short that 
no significance can be attached to the fact that the frequencies for them are 
exceptionally low. There are no examples of interparietal bones, or of an os 
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epaclal, among the thirty-two male and twenty-six female occipital bones. A 
much rarer anomaly is exhibited by a male cranium (P 7 A). This is a trace of 
a suture (33 mm. long) extending across the posterior part of the right parietal 
bone: its position can be seen from Plate IIB. One male skull (P 27) shows the 
ocoipito-mastoid suture obliterated prematurely on both sides. There is one case 
of fronto-temporal articulation on both sides among the males, and one of the 
pterion in 11 on both sides among the females. One female specimen has both 
malar bones divided by horizontal sutures. A male skull (0 3, Plate IIIB) has 
the basi-occipital broken exposing the extension into it of the sphenoidal air 
sinuses, the condition exhibited by the Swanscombe skull. There is one large 
cavity and the extremity of a small one to the right of it. 

Wounds that had healed completely during life were noted on the cranial 
vaults of three of the male and two of the female skulls. None of these was 
severe and a female specimen (T 22) has a depression on the right temple which 
was more serious than the other injuries. An adolescent female skull (P 20, 
Plate II A) has the right malar bone deformed, probably as the result of a wound. 
By far the most serious injuries of traumatic origin are shown by the long bones 
of one of the males (T 5). His right elbow, was shattered, involving complete 
separation of the proximal extremity of the ulna, his left radius was fractured 
near the wrist, and his fractured left fibula became fused to the tibia. Healed 
fractures of the long bones were only found in the case of two other skeletons, 
viz. P 23 (male, right ulna) and P 36 (female, left fibula). Pathological conditions 
of bones not of traumatic or dental origin were only noted in the case of one male 
skull (T 9) which has a swelling on the right maxilla below the orbit which was 
probably caused by a tumour, of a skeleton of the same sex (P 25) which has 
osseous deposits on the muscular ridges of the left femur and fibula, and of a 
female (T 12) which has the left humerus markedly deformed, probably as the 
result of an inflammatory condition of the surrounding soft tissues. We are 
indebted to Dr A. M. El Batrawi for help in interpreting the conditions of these 
and other specimens in the series. 

The teeth of the individuals interred at Maiden Castle are by no means well 
preserved. The following.frequencies are found for the total “Iron Age” series: 



Upper jaw 

Lower.jaw 


£ 

¥ 

£ ■ 

¥ 

No teeth lost before death 

12 

19 

22 

13 

One or more teeth lost before death 

16 

Kfl 

10 

12 


In the case of excavated series of skulls which are not of recent date, it is 
customary to find for either sex that more than half the specimens had lost no 
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teeth from the upper jaw before death. Abscess cavities were found in several 
of the jaws. A female specimen (T 21, Plate III A) has two symmetrically dis¬ 
posed cyst cavities in the buccal surfaces of the maxillae at the roots of the 
second premolars. There are examples of crowded front teeth. A male mandible 
(P 23) has a milk molar retained in the position of the left second premolar, 
which had not appeared, and a female bone (P 26) appears to have had only three 
incisors erupted. The upper jaw of a female skull (T 28, Plate HID) shows the 
canines missing and the first premolars rotated. A female mandible (P 14, 
Plate IIIE) has the left third molar impacted but almost fully erupted: there are 
also distinct swellings on the inner alveolar margin extending from the first 
premolar to the first molar on the left side, and from the second premolar to the 
first molar on the right. Several of the Sinanthropus mandibles exhibit this 
condition (torus mandibular is), and it occurs more frequently, and to a more 
marked extent, in some modem races of man (particularly in Eskimos), but is 
seldom found in European series. Dental anomalies appear to be exceptionally 
frequent in the short series of skulls from Maiden Castle. 

) 

but nearly all might be considered quite unexceptional if found in any British 
collection of later date than the Bronze Age, There are few examples, however, 
of the markedly retreating frontal bones which characterize the seventeenth- 
century London skulls, particularly those from Earringdon Street, The variation 
exhibited appears to be no greater than that expected for a community of inter¬ 
marrying people, except for the fact that one female skull from, the Belgic War 
Cemetery (P 36, Plate IIC, D) stands apart from the others on account of its 
aberrant form. The facial skeleton has an unusual premaxillary height, though 
it is not prognathous; the nasal index is high but not extreme, and the nasal 
bridge is broad and depressed, The cephalic index of this specimen (87-0) is 
easily the highest for the series, and the individual was decidedly short 
(1451 mm. = 4 ft. 9J in.), though taller than two other.women interred in the 
Belgic War Cemetery. It is possible that the skeleton P 36 is that of an alien 
in Western Europe, but it appears to be more probable that its peculiarities are 
of individual rather than racial significance. Its measurements were included in 
computing averages. 

3. Metrical comparisons oe the Iron Age and Romano- 
British crania 

Excluding the two Neolithic specimens, there are 30 adult male and 26 adult 
female crania sufficiently complete to give measurements, though most of these 
are defective to some extent. Individual readings are in the appended Table VII. 
In spite of the small numbers, it appeared worth while computing separate means 
of the more important characters for (a) the Belgic War Cemetery series, and 
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(b) all the other specimens of Iron Age or Roman date. The latter is a miscel¬ 
laneous collection made up by individuals belonging to the following groups: 



Male 



Male 

Female 

Iron Age A 

1 


Romano-British 

3 

3 

„ B 

1 

7 

Iron Age or Romano-British 

2 

1 

„ o 

0 

8 




The pooling of this material, and treatment of it as if all the specimens 
represented a single racially homogeneous population, is obviously a pis aller. 
The means found for the composite group (Table III) are actually very close to 
those found for the Belgic War Cemetery series, which is almost certainly made 
up by individuals who belonged to a single community. On the supposition that 
the variabilities of the groups were of the usual order, all the differences between 
the two sets of means are quite insignificant. Both types have surprisingly large 
basio-bregmatic heights, and hence unusual indices (100 H'jL and 100 B/H') 
involving this diameter. Otherwise, they appear to have no features which can 
be considered at all peculiar in English series of post-Bronze Age date. 

This comparison provides some justification for combining the two sub¬ 
groups, to give a single series which may be supposed to represent the population 
of Maiden Castle from Iron Age A to Roman times, though it is clear that the 
evidence available is quite inadequate to demonstrate that the racial constitution 
of the population remained stable throughout the period. The following distri¬ 
butions are found for the total sample.' 


Cephalic index 
(central values) 

67 

69 

71 

73 

7S 

77 

79 

81 

83 

86 

87 

Totals 


d 

1 


2 

5 

6 

7 

1 

m 


1 

■ 

23 

■ 

¥ 

— 

| 

2 

4 

9 

4 

1 

1 


1 


22 

■ 

Height-length 


■ 





i 

■ 

m 

77-5 

78-5 

79*5 

Totals 

index 

88'6 

mil 

siiaa 

71*6 

72-6 

73-6 

74'5 

76-o 

76-6 

(central values) 









| 

1 

i 

■ 


<? 


3 

i 

4 


n 


H 

1 

|g 

M 


21 

? 

2 

i 

i 

I 


El 


l 

■ 



H 

22 


The highest cephalic index (87-0) is for a female specimen (P 36), in the 
Belgic War Cemetery series, which is also remarkable on account of the fact that 
its facial skeleton is of an unusual form. The highest height-length index is for a 

Biometrika xxxi 20 



















302 


Human remains of the Iron Age and other periods 


TABLE III 

Mean measurements of series of Iron Age and Romano-British 
crania from Maiden Castle 



Male, 

Female 

Belgic War 
Cemetery 

Others 

Total 

Bel^ie War 
Cemetery 

Others 

Total 

L 

187-7 (14) 

189-9 (9) 

188-6 (23) 

179-0(7) 


180-1 (26) 

£ 

141-4 (14) 

140-0 (10) 

140-8 (24) 

135-7 (6) 

135-8 (16) 

135-8 (22) 

W 

137-1 (14) 

130-9 (8) 

137-1 (22) 

134-4 (7) 

131-7 (15) 

132-6 (22) 

B 1 

97-2 (14) 

96-9 (12) 

97-1 (26) 

92-5 (7) 

94-1 (19) 

93-7 (26) 

8 

384-5 (11) 

381-9 (7) 

383-5 (18) 

371-3 (6) 

365-6 (10) 

367-7 (16) 

U 

528-6 (12) 

530-9 (8) 

529-5 (20) 


508-1 (15) 

507-0 (21) 

fml 

37-8 (13) 

37-3 (8) 

37-6 (21) 

35-9 (5) 

35-3 (11) 

35-5 (16) 

Jinb 

31-6 (12) 

31-2 (8) 

31-5 (20) 

28-2 (5) 


28-6 (15) 

LB 

102-5 (13) 

102-2 (6) 

102-4 (19) 

96-7 (7) 

99-9 (12) 

98-7 (19) 

G'H 

73-3 (8) 

71-3 (6) 

72-4 (14) 

68-1 (0) 

67-2 (9) 

67-6 (16) 

Nil, L 

52-4 (9) 

51-2 (7) 

51-9 (10) 

46-7 (6) 


48-3 (18) 

NB 

25-0 (8) 

25-1 (8) 

25-0 (16) 

22-4 (5) 

24-3 (8) 

23-6 (13) 

O x L 

43-6 (9) 

41-2 (6) 

42-6 (15) 

41-3 (5) 

41-3 (9) 

41-3 (14) 

0,L 

33-2 (9) 

32-3 (6) 

32-9 (15) 

32-5 (5) 

31-9 (9) 

32-1 (14) 

100 B/L 

75-4 (14) 

73-5 (9) 

74-6 (23) 

76-6 (6) 

75-5 (16) 

76-8 (22) 

100 H'/L 

73-4 (13) 

72-4 (8) 

73-0 (21) 

75-2 (7) 

72-9 (15) 

73-6 (22) 

100 B/F 


102-7 (8) 

103-1 (21) 

■MKDl 

102-5 (13) 

102-2 (19) 

100 fmblfml 

83-5 (12) 

83-7 (8) 

83-6 (20) 

79-2 (5) 

82-9 (9) 

81-5 (14) 


48-6 (7) 

50-2 (6) 

49-4 (13) 

48-1 (5) 

50-0 (8) 



76-3 (9) 

78-5 (6) 

77-2(15) 

78-8 (5) 

77-2 (9) 

77-8 (14) 

FL 

8fi°-7 (8) 

80°-O (4) 

86°-5 (12) 


83°9 (9) 

85°-2 (12) 


Total series 


■ 


Female 

B" 

122-4 (17) 

114-7 (16) 

JimL B 

114-3 (19) 

107-2 (14) 

<il 

93-4 (14) 

90-3 (11) 

(IB 

95-2 (18) 

89-4 (14) 

J 

133-6 (11) 

123-2 (11) 

<V 

46-6 (19) 

44-0 (12) 

O* 

40-7 (16) 

39-0(11) 

SB 

4-7 (16) 




Male 

Female 

m 

316-7 (20) 

mm 


114-2 (24) 


S' 

116-5 (22) 

112-2 (21) 

S,' 

99-9 (22) 

97-0 (17) 


132-0 (22) 

126-6 (22) 

S* 


125-3 (21) 

s* 

122-3 (22) 

116-0 (17) 

c* 

1514-7 (19) 

1380-7 (19) 



Male 

Female 


10-1 (16) 
58-7 (22) 
76-3 (13) 
88-7 (12) 
47-6 (16) 
61°-0 (14) 
76°-3 (14) 
42°-6 (14) 

9-3 (15) 
60-5 (17) 
73-5 (11) 
87-4 (7) 
47-6 (14) 
63°-2 (11) 
75°-2 (11) 
41°-6 (11) 


* Reconstructed from L, R.and H'. 


male cranium (P 7 A) in the same series. In spite of these outlying cases, the 
distributions for the two characters in question, and those for the other cha¬ 
racters, provide no clear evidence of racial heterogeneity, though it cannot be 
inferred from them that the total population considered was racially homo¬ 
geneous. 
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If the supposition that all the skulls of Iron Age and Romano-British date 
from Maiden Castle represent a single population be accepted, the series available 
is still not long enough to give reliable racial comparisons of a statistical kind. 
A short series is liable to indicate an absence of differentiation from other series 
when one of an adequate length from the same population will provide evidence 
of distinction, and it will also be liable to give a misleading estimate of diver¬ 
gence from other types when distinction is clearly indicated. Large enough 
samples must be demanded, in particular, when making comparisons between 
closely related populations, such as the group made up by all the prevailing 
populations of England from the end of the Bronze Age to modern times. In 
spite of its restricted size, coefficients of racial likeness were computed for 
male means between the total “Iron Age” series from Maiden Castle and four 
others, viz,; 

(a) The Anglo-Saxon made up principally by skulls preserved in London 
Museums (Morant, 1926). 

( b ) The so-called British Iron Age (Morant, 1926) made up by skulls from 
the south of Scotland and various parts of England. Several of the specimens 
are known to be of Romano-Britons, and some of the others are not dated 
satisfactorily, so the series is of little value. 

(c) The seventeenth-century series from a plague in Whitechapel (Macdonell, 
1904), the revised means given by Hooke (1926) being used. 

(d) The seventeenth-century series from the cemetery in Earringdon Street 
(Hooke, 1926). 

The following coefficients of racial likeness are found for the total “Iron 
Age ” series from Maiden Castle and these four, the numbers in brackets following 
the crude values being the numbers of characters on which they are based, and 
the numbers in brackets following the names of the series being the average 
numbers of skulls on which the means used are based (n’s )*: 


Maiden Castle (17'7) and Anglo-Saxon (341) 

„ (18-5) and British Iron Age (664) 

„ (17-7) and Whitechapel (90-7) 

„ (17-8) and Earringdon Street (96-7) 


Crude C.R.L. 
—0-28 ±0-18 (29) 
2-98 ±0-21 (20) 
2 79 ±0-17 (30) 
429 ±0-17 (30) 


Reduced C.R.L. 


10-73 ±0-77 
943 + 0-69 
1427 ±0-68 


As far as can be told from the scanty evidence, the Maiden Castle and Anglo- 
Saxon samples might represent different sections of the same population, while 
the former series is clearly differentiated from the other three. Previous 
comparisons have shown that the British Iron Age and the two seventeenth- 
century London types are very similar, while the Anglo-Saxon stands apart from 
that cluster. These relationships suggested that the Londoners were descended 
primarily from the pre-Saxon rather than from the Anglo-Saxon population, but 
the new evidence does not support this view. 

i 

* The standard deviations of the Farringdon Street series were used in computing the coefficients. 


20-2 
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For characters considered singly, there are very few significant differences 
between the means for the short Maiden Castle series and those for the three 
later series. All the differences from the Anglo-Saxon values are quite in¬ 
significant, the highest of the 29 a’s being 3-8. The only a’s greater than 10 are 
for the nasal angle (N/_, 15-0) and basio-bregmatic height (//', 11 ■ 8) in comparisons 
with the British Iron Age means, for IV (39-7), 100 IV/I (32-7) and 100 B[H' 
(28-7) in comparisons with the Farringdon Street means, and for TV (18-9), 
100 IVIL (16-3), NL (15-2) and 100 BjlV (10-5) in comparisons with the White¬ 
chapel means. The Maiden Castle skulls are markedly orthognathous judging 
from the nasal angle, but this measurement is only available for 14 male skulls 
and little stress can be laid on the peculiarity, Otherwise the type is only 
distinguished by a calvarial height which is large both absolutely and also 
relative to the length and breadth of the brain-box. It is known that the 
Anglo-Saxon is distinguished in precisely the same way from the other types 
considered, 

Table IV gives male means of the three principal diameters and the three 
indices derived from them for the series referred to above and the following: 

(e) Three series of Romano-British skulls described by Buxton (1935), the 
six measurements in question being the only ones available for these. 

(/) An Anglo-Saxon series from Burwell, Cambridgeshire (Brash et at. 1935). 

(g) A third seventeenth-century London series from a burial-pit at Moorfields 
(Macdonell, 1906). 

(A) A modern series of Lowland Scottish skulls measured by Turner (1903) 
and compiled in the way described by Hooke (1926, pp. 22 and 38). 

(i) A modern series from Glasgow (Young, 1931). 

All the series included in the table are believed to represent populations 
which prevailed in England and the south of Scotland at certain periods from 
the beginning of the Iron Age to modern times, and the minority populations for 
which there is craniological evidence are omitted. The length, breadth and 
cephalic index are seen to be remarkably constant throughout, while the basio- 
bregmatic height and the two indices involving this diameter show larger 
differences. The greatest heights, highest height-length and lowest breadth- 
height indices are for the two series from Maiden Castle, and they thus show a 
remarkable concordance in spite of their very restricted sizes. In these respects 
one of the Romano-British series (the Brigantes) and the two Anglo-Saxon come 
next, and the others follow, with one other Romano-British (the Dobuni) and 
two of the seventeenth-oentury London series at the lower end of the scale. 
Little significance can be attached to this order, however, as some of the series 
are short. There are no statistically significant differences, for example, between 
the means for the total series from Maiden Castle and those for the Romano- 
British Belgae. The populations represented in Table IV are all remarkably 
similar in type, and they must have been closely inter-related. Larger series 
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TABLE IV 

Mean measurements of British series of male skulls 


Iron Age and 
Romano- 
British series 


“British 
Iron Age” 

Romano-British 

Belgio War 
Cemetery 

Others 

Total 

Belgae 

Dobuni 

Brigantes 


187-7 (14) 
141-4 (14) 
137-1 (14) 
75-4 (14) 
73-4 (13) 
103-3 (13) 

189-9 (9) 
140-0 (10) 
136-9 (8) 
73-5 (9) 
72-4 (8) 
102-7 (8) 

188-6 (23) 
140-8 (24) 
137-1 (22) 
74-6 (23) 
73-0 (21) 
103-1 (21) 

187-4 (61) 
141-4 (102) 
132-9 (77) 
76-4 (61) 
70-9 (61) 
106-3 (77) 

189-6 (40) 
141-0 (41) 
134-2 (33) 
74-4(40) 
71-0 (33) 
106-6 (33) 

190-8 (85) 
144-2 (77) 
132-6 (46) 
76-6 (71) 
69-4 (43) 
109-9 (41) 

189-9 (67) 
141-7 (67) 
136-8 (38) 
76-7 (61) 
71-4 (34) 
104-4 (33) 


Other 

series 

Anglo-Saxon 

Seventeenth-century London 

Modern Scottish 

• London 
museums 

Burwell 

Farringdon 

Street 

White¬ 

chapel 

Moorfields 

Lowland 

Glasgow 

L 

B 

H' 

100 B/L 
100 H'/L 
100 BjE' 

190-6 (58) 
141-7 (103) 
136-0 (31) 
74-7 (52) 
71-2 (25) 
104-9 (61) 

189-6.(45) 
141-7 (46) 
136-3 (40) 
74-8 (45) 
71-9 (40) 
104-3 (40) 

188-8 (139) 
142-4 (141) 
129-7 (118) 
75-4 (132) 
68-6 (116) 
109-8 (117) 

186-1 (137) 
140-7 (136) 
132-0 (122) 
74-3 (131) 
70-0 (120) 
106-6 (122) 

189-2 (44) 
143-0 (46) 
129-8 (34) 
75-6 (42) 
68-4 (31) 
110-2 (34) 

188-8 (54) 
142-1 (64) 
133-6 (52) 
76-3 (54) 
70-9 (52) 
106-4 (52) 

188-2 (524) 
139-1 (524) 
132-9 (521) 
75-0 (524) 
70-7 (521) 
104-9 (521) 


representing the Iron Age population are obviously required, but the data 
available clearly focus attention on mean differences between the absolute and 
relative magnitudes of the calvarial height. It can be seen from the distribution 
on p. 301 above that all the male skulls from Maiden Castle have height-length 
indices greater than 69-0. In Table IV there are two seventeenth-century London 
series with means for the index less than this value, and one of the Romano- 
British series has a mean of 69-4. These conditions indicate unusual separation 
of the distributions for different series, and there is no reason to suspect that any 
cranial characters other than the height and the height indices would distinguish 
the populations represented in Table IV as effectively. 


4. Measurements of the mandibles 

Measurements of the mandibles were taken in accordance with the revised 
technique given in Biometrilca (Morant et al. 1936, Appendix), and readings for 
individual bones are not provided in the present paper. It has been found 
(Cleaver, 1937) that larger numbers of lower jaws than of crania are needed to 
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give any decisive racial comparisons, and hence the means for all the adult 
specimens of Iron Age and Romano-British date taken together are the only 
ones worth considering. They are given in Table V for totals of thirty-four male 
and twenty-five female bones, and values for Anglo-Saxon (Morant, 1926) and 
a seventeenth-century London (Cleaver, 1937) series are included for com¬ 
parative purposes. 


TABLE V 

Mean measurements of English series of mandibles 



Male 

Female. 


Anglo- 

Saxon 

Maiden 
Castle: 
Iron Age 

Farringdon Street, 
London: seven¬ 
teenth century 

Anglo- 

Saxon 

Maiden 
Castle: 
Iron Age 

Farringdon Street, 
London: seven¬ 
teenth century 

tt'l 

m 

Wr 

ZZ 

cj 
ml 
c T l 
rlV 
i>hPi 

\ 

mji 

c r h 

rl 

ML 

RL 

CL 

100 c r lilml 

100 c fCf/ml 

100 mJCqI 
100 rb'/rl 
100 Mo/c r c 

123-7 (25) 
100-4 (33) 
100-3 (27) 
45-3 (57) 
2L-7 (38) 
107-2 (31) 
77-2 (42) 
33-2 (61) 
28-1 (59) 
33-1 (40) 
27-2 (51) 
65-7 (48) 
64-0 (46) 
120°-3 (47) 
72°-0 (36) 
68°-2 (32) 
60-9 (27) 
94-4 (15) 
129-0 (32) 
51-5 (45) 
99-3 (19) 

121-5 (19) 
99-5 (26) 
99-3 (20) 
44-8 (33) 
21-0 (27) 
105-3 (24) 
79-0 (27) 

33- 1 (33) 
28-4 (23) 

34- 7 (22) 
28-2 (20) 
70-6 (28) 
66-2 (26) 

116°-5 (27) 
77°-6 (24) 
69°-8 (18) 
67-9 (22) 
93-9 (17) 
127-4 (25) 
50-8 (26) 
102-1 (18) 

117-7+ 0-53 (23) ■ 
97-7+ 0-70 (40) 
95-9 + 0-68 (29) 
43-9 + 0-24 (40) 
19-8 + 0-20 (34) 
104-1 + 0-64(34) 
74-9 + 0-40 (40) 
30-9 + 0-28 (40) 
28-2 + 0-18 (22) 
30-9 + 0-48 (12) 
24-9 + 0-45 (19) 
64-8 + 0-47 (40) 
62-2 + 0-40 (36) 
12r-7+0-60 (40) 
72°-0 + 0-94 ( 37) 
61°-8 + l-36 (16) 
62-4+0-66 (34) 
92-3+1-03 (25) 
130-9+1-23 (40) 
50-0 + 0-54 (38) 
102-9 +1-07 (29) 

116-0 122) 

92- 9 (35) 

93- 2 (28) 
44-1 (50) 
19-1 (35) 

104-2 (45) 
74-6 (49) 
31-0 (66) 
27-6 (57) 
30-5 (31) 
24-4 (52) 
59-2 (47) 
59-1 (45) 
122°-5 (49) 
68°-2 (36) 
70°-0 (25) 
58-3 (38) 
91-7 (26) 
126-2 (35) 
53-0 (43) 
99-3 (23) 

117-5 (12) 
90-8 (18) 

95- 9 (18) 
42-5 (24) 
20-0 (14) 
98-9 (16) 
70-5 (19) 
30-4 (24) 
27-2 (19) 
30-5 (17) 
26-6 (15) 
60-2 (25) 
57-3 (20) 

123°-1 (19) 
71°-6 (18) 
70°-3 (14) 
60-5 (16) 
97-1 (12) 
129-2(18) 
53-5 (19) 

96- 2 (14) 

110-8 + 0-64 (30) 
85-7 ±0-55 (50) 

91- 7+0-54 (35) 
42-9±0-25 (49) 
18-0 + 0-16(45) 
99-4 + 0-81 (43) 
69-7 ±0-37 (50) 

28- 3 ±0-25 (50) 
27-7 ±0-22 (19) 

29- 7+0-36 (26) 
23-6+0-51 (12) 
56-5 + 0-47 (50) 
63-5+0-44 (43) 

127°-8±0-59 (50) 
71°-2 + 0-87 (48) 
07 o -3 + 0-80 ( 27) 
56-8 + 0-49 (43) 

92- 4 ±0-74 (30) 
123-3 + 1-02 (50) 

53-2 + 0-66 (43) 
94-0 + 0-86 (36) 


None of these is long enough to give an adequate representation of the type 
for the population it represents, and probable errors are only available for the 
last, A few general comparisons are sufficiently suggestive, however, to be of 
interest. For both sexes the two earlier types are very similar in size, and for 
several characters both appear to be significantly larger than that of the 
Farringdon Street series. There is only one measurement of size which provides a 
clear exception to this rule—viz, the length of the dental arcade from second 
molar to first premolar (mppf —and it is the only measurement taken relating to 
the size of the teeth. Both Anglo-Saxon and Iron Age types appear to be 
distinguished from that of seventeenth-century Londoners on account of less 
protruding chins [C'L greater), and the Iron Age appears to be distinguished 
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from the other two on account of a more outstanding ooronoid process (judging 
from R.L and 100 c r h/ml). Otherwise, there is no clear evidence of distinctions 
between the types. 

Judging from all the characters, the Anglo-Saxon and Maiden Castle popu¬ 
lations were just distinguished by features of their lower jaws, but the resemblance 
between them was closer than that between either and the later Londoners. The 
relations found favour the hypothesis that the mandibles of Englishmen, but 
not their teeth, became slightly smaller in historical times, but more data will be 
needed to substantiate it. If the hypothesis be accepted, then mandible measure¬ 
ments should not be used to estimate racial relationships unless allowance is 
made for secular changes in them within the same population. 


5. Measurements oe the lengths oe the long bones 

Individual readings are given in Table VIII, and means of the'adult series for 
the characters of greatest interest are in Table VI.. The lengths were taken in the 
ways specified by Munter (1936), whose means for Anglo-Saxon skeletons are 
quoted. In spite of the small numbers, it was thought worth while computing 
separate means for (a) the Belgic War Cemetery series, and ( b ) for all the other 
specimens of Iron Age or Roman date from Maiden Castle. 

For both sexes all the absolute measurements for these two series in Table VI 
are less than the corresponding values for Anglo-Saxons. A rough appreciation 
of the significance of the differences between the means can be obtained by 
supposing that the two Maiden Castle populations considered, and also that 
made up by combining them, exhibited the same variabilities as the total 
Anglo-Saxon population. This may appear to be a very arbitrary assumption, 
but in fact it is not unreasonable, since a close approach to equality in variation 
is usually found on comparing different subgroups with a total population, and 
also on comparing distinct populations. 

By applying the Anglo-Saxon standard deviations, we reach the conclusion 
that there are no significant differences between the means of the absolute and 
indicia! measurements in the case of the Belgic War Cemetery and the other 
series from the same site, and this is true for both sexes. On the same assump¬ 
tion with regard to variation, it is found that the pooled Maiden Castle means 
only show markedly significant differences from the Anglo-Saxon in the case of 
the length of the femur for males, the length of the humerus for males and 
females, and the radius-humerus index for males. The differences between the 
statures cannot be supposed clearly significant in the case of either sex, but it is 
safe to conclude from the evidence of both that the Iron Age inhabitants of 
Maiden Castle were shorter than Anglo-Saxons. The sex ratios for stature are 
almost identical, being 1-076 for the former and 1-073 for the latter series, and 
of the order usually found. In inches the estimates of height obtained are 
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TABLE VI 

Means for length of the right femur, tibia, humerus and radius, and indices derived 
from these lengths, for Maiden Castle (Iron Age and Romano-British) and 
Anglo-Saxon series of adult skeletons 


■ 

Male 

Other than 
Bolgic War 
Cemetery 

Bclgic War 
Cemetery 

All Maiden 
Castle 

Anglo-Saxon 

F. max, 

T, max.* 

H. max. 

R. max. 

100 T. obl./F. obi. 

100 H, max./F. obi. 

100 ft, max./H. max. 

100(H.max.+R. max.)/(F. obl.+T. max.)f 
Reconstructed stature 

437-6 (11} 
302-8 (11) 
323-0 (12) 
240-8 (10) 
81-2(10) 
73-9 (11) 
76-4(10) 
71-2 (9) 
1040 (13) 

443-2 (15) 
371-2 (18) 
320-8 (17) 
250-9 (16) 
82-5 (14) 
74-3 (13) 
77-2(15) 
71-0(12) 
1054 (19) 

440-8 (26) 
368-0 (29) 
325-2 (29) 
246-9 (25) 
82-0 (24) 
74-1 (24) 
7G-4 (25) 
71-5 (21) 
1649 (32) 

463-3 ±1-22 (153) 
378-9 + 1-46 (103) 
337-1 + 1-15 (121) 
251-6+1-12 (79) 

81-1 + 0-17 (92) 

73- 5 + 0-14 (100) 

74- 6+0-19 (62) 

70-7 + 0-16 (41) 

1683 (161)j: 

Female 

F. max. 

T. max.* 

H. max. 

R. max. 

100 T. obl./F. obi. 

100 H, max./F. obi. 

100 R. max./H. max. 

100 (H.max.+R, max.)/(F. obl.+T. max.)t 
Reconstructed stature 


411-0 (8) 
842-7 (7) 
290-4 (8) 
215-1 (7) 
81-2 (0) 

72- 3 (7) 

73- 4 (7) 
09-7 (5) 

1527 (9) 

411-5 (21) 
338-6 (21) 
298-9 (19) 
220-3 (20) 
81-4(19) 
73-5 (15) 
73-8 (16) 
70-3 (12) 
1532 (26) 

426-1 + 2-05 (66) 
350-4+1-99 (44) 
312-6 + 1-54(47) 
227-6+1-41 (34) 
80-8 + 0-20(38) 
73-6 + 0-25(36) 
73-5 + 0-22(31) 
70-6+0-21 (21) 
1668 (69)} 


* Maximum length of the tibia including the spine. f Maximum length of the tibia excluding the spine, 
l The reconstructed statures for Anglo-Saxons were found from the mean lengths for different long bones, 
and tile numbers of individuals given are the numbers of femora involved, The statures actually relate to rather 
larger numbers of skeletons. 


5 ft. 6| in. for Anglo-Saxon and 5 ft. 5 in. for Iron Age men, and 5 ft. 1| in. for 
Anglo-Saxon and 5 ft. 0J in. for Iron Age women. The average stature of the 
general male adult population of England to-day is about 5 ft. 7| in., and for 
different social classes means between about 5 ft, 5f in. and 5 ft, 9| hi. are 
found. The Iron Age men were thus decidedly short compared with modern 
Englishmen, and the estimated stature for them is very close to the average 
found for all European populations to-day. 

For the measurements considered, the only distinction between the two 
ancient populations other than that in size is found for the index expressing the 
length of the radius as a percentage of the length of the humerus in the case of 
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the male, but not in that of the female, sample. It might be suggested that the 
proportions of the upper and fore-arms of the men who lived at Maiden Castle 
were influenced on the right, or on both, sides by continual practice in the use 
of the sling, which is known to have been one of their principal weapons from 
the evidence of caches of stones. The following means, which relate only to 
paired indices, are of interest in this connexion: 



100 x Radius max./Humerus 

max. 

100 x (H. max. + R. max.)/ 

(F. obi. +T, max. ex spine) 


Maiden Castle 

Anglo-Saxon 

Maiden Castle 

Anglo-Saxon 


Male 

Female 







R. 

L. 

R.-L. 

76-3 (18) 
78-4 (18) 
-2-1 

73 e (14) 
75'0 (14) 
-14 

746 (25) 
75-7 (25) 
-H 

n 

71-5 (11) 
70-8 (11) 
+0-7' 

70-3 (12) 
68-9 (12) 
-h 1*4 

70-9 (21) 
70-0 (21) 
+0-9 



All the series are very restricted in size, but there is complete agreement in 
the signs of the side and sex differences found. The most significant difference 
between the two populations appears to be that for the male radius-humerus 
index on the left side. This might suggest that the Maiden Castle men were left- 
handed slingers, but their intermembral indices are so close to the Anglo-Saxon 
values that it seems unsafe to accept the hypothesis that the lengths of their 
arms were affected by use. More abundant material might tell definitely for or 
against such a view, and a detailed anatomical examination of the arm bones of 
the Maiden Castle and modern series would also be relevant to the question. 

6. Summary and conclusions 

The material studied consists of the imperfect skeletal remains of eighty- 
three individuals, who are distributed in Table I according to periods, sexes, 
and ages at death. This paper is concerned chiefly with the customary measure¬ 
ments of the crania and mandibles, and with the lengths of the long bones. 
Comparisons of various kinds are made between: 

A , the Belgic War Cemetery series, representing the defenders of the Castle 
who were massacred by Roman invaders in a.d. 43, and 

B, a composite series made up by all the other specimens of Iron Age and 
Roman date. 

There is no clear distinction between the distributions of the ages at deaths 
of the adults forming the two short series, but for both the men died at a rather 
younger age, on the average, than those interred in English cemeteries of later 
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date, while the females are not distinguished in the same way. Remarks on 
cranial anomalies are provided. The teeth of the inhabitants of Maiden Castle 
were not well preserved. 

Judging from their appearance, nearly all the crania in series A and B would 
be considered quite unexceptional if found in any British collection of later date 
than the Bronze Age, but there are few examples of the markedly retreating 
frontal bone which characterizes seventeenth-century Londoners. There is 
agreement between the male and female samples and this is confirmed by 
measurements of the skulls and long bones. 

There are no statistically significant differences between the mean cranial 
(Table III) and long bone (Table VI) measurements of series A and B, Both are 
characterized by a large calvarial height, and the estimates of stature they give 
are about 1 in, less than the Anglo-Saxon values. The type of the total series 
(A+B) of crania is found to be indistinguishable from the Anglo-Saxon, while 
both are differentiated from the types of seventeenth-century London series. 
This conflicts with the conclusion reached previously, from very inadequate data, 
for Iron Age and Romano-British crania, to the effect that the Iron Age and 
recent types are very similar while the Anglo-Saxon stands apart from both. 
More abundant material representing the Iron Age and Romano-British popula¬ 
tions will be required to disclose the relationships of these closely allied groups. 
The cephalic index is practically constant for them, and the cranial types are 
distinguished most clearly by differences in the absolute and relative magnitude 
of the calvarial height (Table IV). 

Measurements of the mandibles make a slight distinction between the 
Maiden Castle and Anglo-Saxon series, while both types are larger than that of 
seventeenth-century Londoners. The Iron Age men, but not the women, are 
distinguished from Anglo-Saxons by having a larger radius-humerus index. It 
is not clear that this difference is due to the fact that the men at Maiden Castle 
were slingers. 

APPENDIX 

Tables (VII and VIII) of individual measurements and remarks 

The Letters denoting periods (or groups) givon in tho third columnH of the tables are: 
N. =Neolithic, I.A. = Iron Age (A, B or C), S, = Saxon, R.-B. = Romano-British, B. = Belgic, 
B.W.C. = Bolgic War Cemetery. The letters denoting cranial measurements aro those used 
in all craniometric papers in Biometrika. A list of thorn is givon in tho present volume, 
p. 162, but this does not include B"=maximum frontal breadth (Martin, No. 10), and 
Bicist. R.=biasterionic breadth (M, 12). Owing to their fragility, the capacities of the crania 
could not he determined by any direct method. Tho estimates given in Table VII were 
obtained by applying the reconstruction formulae involving L, B and II' given by Hooke 
(1926, p. 33). The reconstructed statures in Table VIII were obtained by applying the 
formulae given by Pearson (1898), using as many as possible of tho bones involved in each 
case. The lengths of the long bones were determined in the ways adopted by Miinter (1936). 
All the readings given in the two tables, whether queried or not, can be considered close ap¬ 
proximations to the true values, with the possible exception of the few measurements of 
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theNeolithic skeletons enclosed in square brackets. Those are more uncertain than thoothors, 
but they are given because the specimens are of particular interest. 

Unless otherwise stated in the remarks given in Table VII, the cranium and mandible 
are complete, or almost complete, with both dental arcades intact and no teeth lost from 
either jaw before death, and the coronal, sagittal and lambdoid sutures are open. Absence of 
a third molar denotes that the tooth had never erupted, as far as can be told. Two male skulls 
are not in the tables, since no measurements can bo given for them, but they were included 
in determining the frequencies of qualitative features. These are: 

P 24 (B.W.C.). Sagittal suture closed, coronal and lambdoid beginning to close. Upper 
jaw incomplete and some teeth lost before death, apparently only 1 incisor R.; 2 teeth lost 
from mandible and M 3’s absent, abscess cavities at roots of M l’s. 

It 2 (?I.A.). Sagittal suture closing. Upper jaw missing. Superficial wound on frontal 
bone. 


LIST OF PLATES 

Plate I. A typical male skull (P 30) from the Belgic War Cemetery. This specimen has a cephalic 
index which is very close to the mean for the series, but its height-length index (70-9) is below 
the average (73'4) and it is peculiarly orthognathous (PZ.=90°). The lower incisors are crowded. 

Plate II. Exceptional skulls from the Belgic War Cemetery. 

A. A female adolescent cranium (P 20) with injury to the right malar hone. 

B. A male cranium (P 7 A) with a trace of a suture on the right parietal bone. 

C and D. A female skull (P 36) with an exceptional type of facial skeleton and high cephalic 
index (87-0). 

Plate III. Skulls from Maiden Castle with dental and other anomalies. 

A. A female cranium (T 21) with cyst cavities at the roots of the second premolara. 

B. The broken basi-ocoipital part of a male cranium (0 3) showing the extension into it of 
sphenoidal air Binusea. 

C. The palate of a male cranium (P 7) with a large anterior palatine foramen. 

D. The palate of a female cranium (T 28) showing absence of the canines and rotation of the 
first premolars. 

E. A female mandible ( P 14) with the left third molar impacted and mandibular torus. 
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Plate II 



A. P 20, female. Injury to right malar bone. B. I.' 7 A, male. Trace of an turn on right parietal bone. 




Exceptional skulls from the Belgic War Cemetery. 
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B. 0 3, male. Broken basi-ocoipital showing the 
extension into it of sphenoidal air sinuses. 


A. T 21, female. Cyst cavities at roots of second premolars. 


C. P 7, male. Large anterior palatine foramen. 


E. P 14, female. Left third molar impacted and 
mandibular torus. 


B. T 28, female. Absence of canines and first premolars 
rotated. 


Skulls from Maiden Castle with dental and other anomalies 



HOMOGENEITY OE RESULTS IN TESTING SAMPLES 
FROM POISSON SERIES 

WITH AN APPLICATION TO TESTING CLOVER SEED FOR DODDER 


By J. PRZYBOROWSKI and H. WILENSKI 
Plant Breeding and Agricultural Experimentation Department, 

University of Krakow, Poland 

1. Introductory 

In our previous paper, “Statistical principles of routine work in testing clover 
seed for dodder” (Przyborowski & Wilenski, 1935), we justified the assumption 
that the number of dodder seeds in samples of clover does follow the Poisson 
Law: x 

P(») = e ~ M 7T (* = 0,1,2,...)- (1) 

Thus we constructed rules which should be followed in sampling problems 
where that Law holds good. We analysed particularly the application of the 
rules to the routine work in testing clover seed for dodder. 

The purpose of this paper is to present some theoretical considerations 
relating to the problem of homogeneity of results in testing samples drawn 
from Poisson series which we have found arising in the course of our work on 
testing clover seed. 

Let % and m a denote the parameters of the Poisson distributions of the 
variables x x and x i ; the problem to be considered consists in testing the hypo¬ 
thesis J/ 0 (%=wi 2 =m), that the values x x and were obtained in sampling 
from Poisson series, the parameters of which have a common but unspecified 
value, say m. 

The method of testing statistical hypotheses as developed by Neyman & 
Pearson (1933) consists in selecting a rule of rejecting the hypothesis in question, 
whenever the sample point E (the co-ordinates of which in the ^-dimensional 
sample space W are the data of observation x v a? a ,..., x n ) lies within the 
so-called critical region, say w, of the sample space W. The probability, 
P{Eew | if 0 } = a, of rejecting the hypothesis tested, when it is true, is called 
the size of the corresponding critical region w on which the test is based. The 
errors thus committed when rejecting the true hypothesis # 0 , are called the 
errors of the first kind. The probability P{Eew | E'} of rejecting the hypothesis. 
//„ when an alternative IP is true has been termed the power of the test with 
regard to H', P{Eew | H'} considered as a function of H', where E' is any 
hypothesis belonging to the class Q of alternatives to J/ 0 , is called the power 
function of the test. The error committed when we fail to reject H 0 although 
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The hypothesis to be tested, H 0 , is that m x - m 2 or that p = \; we shall 
suppose that the admissible alternatives to H 0 are both that m x >m 2 and 
n h < m 2 or that p<\ and p>\. Since H 0 does not specify the value of p (which 
may have any value between 0 and oo), it is a composite hypothesis in the 



Fig. 1. Critical region for case a=0-05. It consists of the two areas containing the larger dots. 

sense of Neyman & Pearson. To test it, we should like to be able to use a critical 
region, w, such that 

P{Eew\p = l,p} = a, .(9) 

whatever be p. No such region can be found, owing to the discontinuous 
character of the distribution of the variables, but it is possible to go a long way 
in determining a satisfactory region on the following lines. 

It will be noted that if p = \, then the distribution of x x on each line, 
n = constant, is that of the symmetrical binomial + If p is either > | 
or < we shall have a skew binomial with probability density concentrating 
towards the lower or upper end of the line; in order to make the test as efficient 
as possible in detecting departures from p = \ in either direction, the critical or 
rejection region should therefore include the two tails of each of the separate 
binomials It may be defined as follows. 




Buildup the critical region r«4 " t Hr,, -t m! *h*- hn«-* 

n = ,r, '-x 2 - rmirtanl jm * 1 . 1 , 2 , ... i, .(| 0 ) 

where w(n, a) include* all wnnple «m * !**? I*«t 

,r, •: k(n, x'j and ,r, - n *?’>, 5":. .(||) 

/r(«, a) being a jaiaifive integer dHrmiiiwd fb«< 

(a) the sum of the terms of tin* binomial t{ t if' • «irr<'*jfMj»i)iMg to 

It, 1, ... if n, ,* lit* ' | *, 

(/?) the aunt of the terms rorrvajuiiding i»» 

t), 1, ,,,, i'fii, kin, fi * J r« * • t 

The piece* «•(#, at) ami part ofthe roiiipHe rrgi»»n »»• !<•* *I»» • .w> x ■■ (i(Ci are 
indicated in Fig. 1. In view of the form *«f the **!*«y law Hi) it will be 

seen that tin* region »r is such that 

VlEtJf | p | ,pl' t, -fill) 

whatever be /<. Thun if we reject the ttyi*>«tI******** //„ who h wwim** nt,u 
whenever the sample point rtmeluded in -<r, t<* fall« »* »ny «4 the points im 
dicatedin Fig. Ihy tin* larger wehrem that tb*«f»>4. «4 the find kind of error 

is at most etpial to a. Further, while it numnf l*»* rhune^i ?J»«t tlmtet i« com¬ 
pletely unbiased or in the uniformly most {xmerfoS w*l*MM*e4 ,* it term* likely 
to be a» efficient a* any other nltortiHtivr ie»t hi »1h*-i ting *!cjw*t«mwmpfrum \, 
In Table I we have given the Imimdary va1iu«* t»n. *'j «4 thr regions 
associated with four value* of x, v»/ H 2tt. Hi ft, «■«*;*, >H»i. *nd t*r » *4*,+^ 
varying from 0 to 8ft. Having derided on the »ppmj-»rii*t«« ).»r x, dm rule 
of the test is: reject the hypothec that »», ■ m t if 

*1 fir r,?» kin 

lb will be seen that for small value* of » no leaf «**»*■ iatr*! ■** sti» flic «rmpiired 
may be possible. 

The power function of the teat, which dcjirii*!* on l*»»<h p and p, may ho 
calculated from the expression 


WM-sr,' v 

*-al n, ww 


. jj.i’i'fa x. f 


1 V 


03} 


where is the sum of the terms of tin* lunomnd rxi«iuus*i**sn 
nils, <o 


satisfying the conditions (11 j. ' * ' 

Values of the power for film two mm at Hi and jc ««:*, and fora number 
of values of /i and p are given in Table II a and h, Ths* mni|*wi<siwtt involved 
(a) first finding for a given n the sums of the bin* imiftl ternif in flu* wwfld 
summation of (13) with the help of the Tables of the Jmw* tkut ftttwfee, 

* These terms have been applied to »I * amM». 
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(1034), and (b) then multiplying this by the appropriate term of the Poisson 
series, and (c) finally summing for n = 0,1, 2,.... 

It will be noted that if the admissible alternatives to m 1 = m 2 are only that 
fl h> m z> *hen the appropriate critical region will be defined by x 1 ^n-k(n, a), 
and the risk of the first kind of error will now be < \ct. Similarly, if the alter¬ 
natives are only m x < m 2 , the region will be defined by x L < k(n, a). These regions 
correspond to the lower and upper marked areas, respectively, of Fig. 1. 

We have also given in Tables II in certain cases the actual value of 
P{Eew | p = \,p), that is to say, the chance of rejecting the hypothesis that 
m x — m 2 when it is true. These chances are, as expected, considerably less than 
the corresponding values of a, but approach a as p increases and the dis¬ 
continuity in the distributions of x x and x 2 becomes of less importance. In 
practice we shall not, of course, know exactly what p is, but as we shall probably 
have a rough idea of the size of the Poisson m in the particular problems 
investigated, the figures in these columns of the Tables will probably be helpful 
in determining the value to select for a, the upper limit of the significance level. 
Table III contains true significance levels for the other two limiting values, 
a = O'20 and a = 0-01, for which the rejection levels k(n, a) have been given 
in Table I. 

From the data of Tables II it was possible to construct the charts given in 
Figs. 2 and 3 (pp, 322, 323 below), showing in terms of the Poisson parameters 
m 1 and w 2 (rather than p and p ) some of the contours of equal power. 

3, Discussion of tables and charts with some 

ILLUSTRATIVE EXAMPLES 

A point which is clearly brought out by the present investigation is that it 
is impossible to detect differences between the means of two Poisson series 
mj and m 2 , when both these quantities are small, even if one is several times 
larger than the other. Thus even were m 1 = 0 and m 2 - 5, the chance of 
detecting the difference from the samples would be only 0-384 using a = 0-05, 
and 0-560 using a = 0-10.* As an illustration of the kind of differences that may 
be detected, Fig. 2 may be used as follows. It might with reason.be regarded 
as undesirable to plan an experiment in which the chance was less than 0-5 of 
detecting from two random samples that differences between m x and m 2 of 
practical importance existed. If we pass along the 50 % power contour in the 
diagram, we pass through points (m lt m 2 ) of which the following are typical: 

(0-0, 4-7), (2-0, 8-1), (6-0, 14-3), (10-0, 19-8), (18-0, 30-6). 

If it is important to be able to detect differences of this magnitude, then larger 
samples must be taken in order to increase the expectations. If the samples 

* The true values of the significance level in these cases would be 0-09 and 0-024 respectively; 
if the. test were used with a = 0-20 (and a- significance level in this case of 0-065) the chance of 
detection would be somewhat greater. 
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can, for instance, be increased c-fold, the expectations will be increased to 
m v c%, a point lying on the same straight line passing through the origin as 
m x , wijj, but placed c times further out. Since such radial lines will cut the 
contours of equal power at rather acute angles, considerable multiplication of 
sample sizes may often be needed to ensure detection of differences. 

Regarding a chance of detection of 19 to 1 as satisfactory, we note that the 
95 % contour in Pig. 2 passes through points {m v m 2 ) of which the following 
are typical: 

(0-0, 9-1), (2-0, 15-5), (6-0, 24-0), (10 0, 31-5). 

Differences of this kind, if they exist, can therefore be almost certainly detected 
using in the test the critical values associated with a = (MO. Rather smaller 
differences could be detected with the same high probability if a were taken 
as 0-20. 

In practice, of course, expected values m x and m 2 are not known in advance, 
but we believe that reasonings of this kind based on the tables or charts may 
be useful as a preliminary step in planning the extent of sampling required 
either in experimental work or routine analysis. 

Example. In the testing of clover seed for dodder it is customary to withdraw 
a 100 gr. sample from a sack with a long trier or probe. Clearly the number 
of dodder seeds found will vary from sample to sample, and from one sack to 
another in the same consignment. This may be due to the following causes: 

(1) The material from which the samples were drawn was not thoroughly 
mixed. 

(2) The two samples were drawn from different seed stocks. 

(3) There are laboratory errors in analysis. 

(4) The existence of random sampling fluctuations. 

In applying statistical analysis in the comparison of two counts, the 
hypothesis to be tested assumes that the discrepancy to be tested is merely 
due to chance, and that the frequency of dodder seeds, x, will vary from one 
sample to another in accordance with the Poisson law. If a significant difference 
is found, it may of course be due to one of the first three sources of error. 

Suppose that a 100 gr. sample is drawn from a sealed sack of clover and 
sent to a seed testing station, and that no dodder seeds are found in it, i.e. 

= 0. The buyer, before opening the sack, takes another 100 gr. sample, and 
on having it analysed learns that three seeds have been found, i.e. a: 2 = 3. 
Would he have any justification for criticizing the first testing on the grounds 
that the difference between 0 and 3 could not be due to chance? On examining 
Table I, it is seen that for n=x 1 +x 2 = 3, even using the least stringent of the 
four tests, the difference cannot be regarded as significant.* 

Suppose that on another occasion it was found that ■ x 1 = 2 and a; 2 = 6, 

* It will be noted from Table III that with p. equal to 5 or less, the true significance level for 
the first test with a = 0-20 is at about 006. 
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Entering Table I with n = 8, we see that for none of the levels of test does 
x i — 2 fall in the critical region. If the purchaser still considered that to find 
three times as many dodder seeds in one sample as in the other could not be 
due to chance, we might illustrate the danger of drawing conclusions in this 
way by a use of one of the charts. If one of the expected numbers were three 
times the other, so that m 2 = 3wq, and therefore p = 0-25, we may ask how large 
H = wq+m 2 must be before there is as much as an even chance of detecting the 
difference. Taking Pig. 3, it is found that the line m 2 = 3?% cuts the 50 % power 
contour at about the point m 2 = 13-8, m 1 =4-6. Thus samples of clover at least 
twice as large as those taken would be needed to provide only a 50: 50 chance 
of detecting a real difference in dodder content of the order suspected by the 
purchaser. In this case the true significance level of the test used is seen from 
the number on the diagonal line of the chart to be about 0-03. 

To be almost certain of detecting a difference with m 2 = 3%, for example 
to make the odds 19 to 1, we notice that a continuation of the same line outs 
the 95 % power contour where m 2 = 38-4, m 1 = 12-8. For this samples of about 
600 gr. would be needed. 


4. Summary 

The problem considered is that in which x x and x 2 are two independent 
random variables distributed in accordance with the Poisson law of equation (1), 
and it is desired to test the hypothesis that the expectations m 1 and m 2 are the 
same. We have shown how a test may be derived which is independent of the 
value of the unknown common hypothetical expectation but which, owing to 
the discontinuous nature of the probability distributions, will only provide an 
upper limit to the significance level, i.e. to the chance of rejecting the hypothesis 
tested when it is true. The manner of approach of the significance level to its 
upper limit has, however, been investigated numerically. 

A table (Table I) has been provided, containing critical values k(n, a) 
required in carrying out the test. The power function of the test has also been 
determined, and tables (Table II a, b) and charts (Figs. 2, 3) given which make 
it possible to determine the chance of detecting differences in the expectations 
wq and m 2 of specified magnitudes. Finally, a discussion of some uses of the 
test has been added. 

In conclusion we should like to express our thanks to Prof. E, S. Pearson 
for his helpful suggestions and criticism made during the course of our in¬ 
vestigation.* 

* [Certain modifications and additions to the paper have been made since it was received for 
publication at the beginning of July 1939. As circumstances have unfortunately made com¬ 
munication with the authors impossible, I must accept responsibility for these alterations, which 
have been mainly concerned with the extension of Tables X and II to higher values of n and fi. 
E. S. P.] 
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TABLE I 


Boundary values, k{n, a), for each n 


n 

Upper limit of significance level, a 

n 

Upper limit of significance level, % 
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1 
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19 
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15 
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20 
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26 

25 

22 

28 

10 
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8 

0 
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8 
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27 

25 

23 
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10 
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7 
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29 
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20 

23 
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11 

10 

9 
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28 

26 

24 
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27 
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12 
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Rule of test: reject hypothesis that m l - m t if x^kin, a) or x{zn~k{n, a). 


See note on p. 323 regarding extension of limits for n>80. 
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Note regarding Table I. When x l and x l are large, if the' hypothesis tested is true then 
[x 1 —x i )j'\/{x L +x i ) may be regarded as a unit normal deviate or, from another point of view, iq 
should be normally distributed about \n with standaxd deviation %. On this basis, in the 
ease of » = 80, the values of k{n, a) for a=0-20, 0-10, 005 and 0 01 are found to be 34-3, 32-6, 
31.2 and 28*5, respectively. Applying a correction for discontinuity we have 33, 32,30 and 28 for 
the integral values of k(n, a) associated with upper limits of significance level equal to the four 
values of a. These are the values for k(n, *) given in the last row of Table I. It is clear that 
beyond n = 80 the normal approximation is valid. 











ON THE METHOD OF PATRED COMPARISONS 
By M. G. KENDALL and B. BABINGTON SMITH 


Introduction 


1. Suppose we have a number of objects A , B, 0, etc. which are to be 
considered according to the different degrees in which they exhibit some 
common quality. If the quality is measurable in some objective way the 
objects will yield a number of variate values, in which case the problem is 
amenable to treatment by well-understood methods. It may, however, happen 
either for theoretical or for practical reasons that the quality is not measurable. 
We then have to rely for a discussion of the variation of the quality on judgments 
of a more or less subjective kind carried out after a comparison of the objects 
among themselves. 

One of the methods of comparison which has been widely used in this 
connexion is that of ranking. An observer examines the objects and arranges 
them in the order in which he judges them to possess the quality under con¬ 
sideration. This arrangement iscalled a ranking, and when two or more observers 
provide rankings of the same set of objects there arise the familiar questions of 
the type: is there any significant resemblance between the judgments of 
observers? or, do the data furnish any evidence that the objects have a “real” 
objective ranking? 

2. The ranking method suffers from a serious drawback when the quality 
considered is not known with certainty to be representable by a linear variable. 
We may, for instance, ask an observer to rank a number of individuals in order 
of intelligence, and he may comply with the request in the full belief that he 
is doing something within his powers; but if intelligence is not measurable on 
a linear scale this ranking may fail to give a real picture either of the observer’s 
preference or of the variation of intelligence among the individuals. It is not 
impossible that the observer should judge A more intelligent than B, B than 0, 
and G than A, if the individuals are presented for his consideration one pair at 
a time. The likelihood of this happening is obviously increased when we are 
dealing with tastes in music, eatables or film stars; and in practice the event 
is not uncommon. Such “inconsistent” preferences can never appear in ranking, 
for if A is preferred to B and B to C, then A must automatically be shown as 
preferred to G, The use of ranking thus destroys what may be valuable infor¬ 
mation about preferences. 

■3. In this paper we consider a more general method of investigating pre¬ 


ferences. With n objects, we shall suppose that each of the 


n 

2 


possible pairs 


is presented to an observer and his preference of one member of the pair noted. 
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We assume that a choice between two objects can always be made.* With 
m observers the data then comprise m j preferences. The questions to be 


discussed include: 

(a) Is there any evidence that a particular observer is capable of forming 
a reliable judgment of the quality under investigation; and if not, is the fault 
his, or is it due to the fact that he has been asked to perform an impossible task? 

(b) Is there any significant concordance of preferences between observers? 

(c) Gan the quality under discussion be represented by a linear variable? 

4. The method of offering for judgment objects two at a time is known as 
the method of paired comparisons. Hitherto it has been used mainly in human 
psychology, but it has some interesting applications in animal experimentation. 
For instance, in feeding experiments it is impossible to get an animal to rank 
a number of foods in order of preference but it, is not difficult to offer pairs of 
foods and to note which is taken first. Experiments of this kind have, of course, 
to be conducted with great care to ensure that conditions operating when the 
different pairs are offered are as constant as possible; but the difficulties are 
far from being insuperable and the method of paired comparisons offers a useful 
technique in cases where the more usual procedures cannot be applied. From 
the point of view of theoretical statistics perhaps the most interesting part of 
the present work is that it offers some lines of approach to the difficult question 
whether a given quality can be legitimately regarded as based on a linear 
variable, i.e. whether ranking or scoring methods are justifiable or not. 


Consistence in preferences 


If the object A is preferred to B we write A->B or B+-A. The 



preferences of a single observer may be represented in tabular form as shown 
in Table I. 

In this table, which is shown for the six objects A to F, an entry of unity 
in column Y and row X means X-,- Y, and is thus accompanied by a comple¬ 
mentary zero in row Y and column X, The diagonals are blocked out. For 
example, in Table I, A-^B, A-+C, D->A, etc. 

The arrangement of the objects A to F in the row and column headings is 
quite arbitrary. There are (n !) 2 ways of representing the same configuration of 
preferences in such a table according to the permutations of objects in row and 


* That is, we exclude cases in which an observer cannot make up his mind which object lie 
prefers, just as in the ranking case one excludes the possibility of split ranks. In practice it some¬ 
times happens that an observer is genuinely unable to reach a decision. To .allow for this fact in 
the theoretical discussions would introduce complications of a most intractable kind. When the 
effect becomes important in practice it can be allowed for by selecting the set of preferences which 
are most unfavourable to the hypothesis under test. 
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TABLE I 



A 

B 

0 

D 

E 

F 

/t 

— 

1 


0 

1 

1 

B 

0 

_. 

0 

1 

1 

0 

U 

0 

1 

— 

1 

1 

1 

D 

1 

t) 

0 

— 

0 

0 

E 

0 

0 

0 

1 

-- 

1 

F 


B 

0 

1 

0 

— 


column; but in practice it is generally desirable to have the order in row and 
column the same, and even among the n ! possible arrangements so given there 
are often practical considerations which determine one order as more convenient 
than others. 

6. Paired comparisons may also be represented geometrically by a method 
which can be illustrated for the case of the six objects as follows: 



lig. 1. Geometrical representations of the scheme of preferences of Table I. 

We represent the six objects A to F by the six vertices of a regular hexagon 
and join the vertices in all possible ways by straight lines. If A-^B we draw 
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an arrow on the line AB pointing from A to B. The arrows shown on Fig. 1 
correspond to the preferences shown in Table I.* 

7. If an observer makes preferences of type A-+B-+C-+A we say that the 
triad ABO is inconsistent. In the geometrical representation an inconsistent 
triad is shown by a triangle in which all the arrows go round in the same 
direction. We may thus speak of a “circular” triad of preferences. In Fig. 1 
the triads ACD, BEE and three others are circular. 

It is also possible to have inconsistent triads of greater extent; but any 
such circuit must contain at least two circular triads. Suppose, for instance, 
that ABOD is circular, e.g. that A. B > 0D yA . Then either A _ ^ 0 or O^A. 
In the first case ACD is circular, in the second ABO. Similarly either ABD or 
BOD is circular. Thus the circular tetrad must contain just two circular triads. 
On the other hand it is possible for a tetrad to contain circular triads without 
being itself circular. 

Similarly, if ABODE is circular either ABO or AODE is circular and either 
BOD or BDEA is circular. If the two tetrads are circular there must be at least 
three circular triads (not necessarily four, because ADE may be common to 
both). It is easy to see by an actual example based on this configuration that 
there need not be more than three circular triads; and it is clear that there must 
be at least three. For if the tetrads are not circular then ABO and BCD must 
be so and then either ODE is circular or ABCE is so, adding at least one more. 

Generally, it appears that a circular n-ad must contain at least (w — 2) 
circular triads; but it may contain more, and the fact that an n- ad contains 
(n- 2) circular triads does not mean that it is itself circular. In discussing 
inconsistences, therefore, it seems best to confine attention to circular triads, 
which, so to speak, constitute the inconsistent elements of the configuration, 
and to ignore the more ambiguous criteria associated with circular polyads of 
greater extent. 

8. We now prove the following theorems: 

(1) The maximum possible number of circular triads is [n a ~n)j 24 if n is 
odd and (a 3 - 4n)/24 if n is even; and the minimum number is zero. 

(2) These limits can always be attained by some configuration of pre¬ 
ferences. 

(3) For any integral number .between the maximum and the minimum 
there exists at least one preference-configuration with that number of circular 
triads; and in general there will be more than one. 

Consider a polygon of the type shown in Fig. 1 with n vertices. There will 

* These preferences were obtained in an experiment on a dog, which was offered the following 
foods in pairs; meat, biscuit, chocolate, apple, pear and cheese. The members of a pair were cut 
to the same size and placed equidistantly from the dog, which was then released and allowed to 
choose. All the pieces of food were eaten avidly, it being that sort of dog, but there were considerable 
inconsistences in choice. We do not offer these data as more than an illustration of the 
method. 



328 


On the Method of Paired Comparisons 


be (n- 1) lines emanating from each vertex. Let a v a 2 , a„ be the number 
of lines at the respective vertices on which the arrows leave the vertex. 

n ( n \ 

Then .S' (a,.) = M 

and the mean value of a r is [n -1 )/2. 

Define T = sloe-—-] 


a,,, n(n-l) 2 
: B[a;) -i— • 


•( 1 ) 


We now show that if the direction of a preference is altered and the effect 
is to increase the number of circular triads by d , T is reduced by 2d; and 
conversely. Consider the preference A-+B, The only triads affected by altering 
this to B->A are those containing the line AB. Suppose there are a preferences 
of type A->X (including A->B) and [1 preferences of typo B->X. Then four 
possible types of triad arise: 

say p in number 

A<-X->B, 

A->X~>B, which must number <x-p- 1 
/l A. ■<—,, ,, fi — j). 


When the preference A~>B is reversed the first two remain non-circular. 
The third becomes circular, the fourth ceases to be so, The reduction in the 
vlueofTta aMo _ 1)!+/!M/}+1)S 


•= 2(a — // — 1) 

= 2d, say. 


Tire increase in the number of circular triads is 


(a-p-1)-(/?-. p) = «-//-! 

= tl. 

More generally, if as the result of reversing any number of preferences T is 
decreased by 2d, then d must be an integer and the number of circular triads 
must be increased by d, This clearly follows from the previous results for the 
reversal of preferences can take place one at a time and the effect on T and the 
number of circular triads is cumulative. 

We now investigate the maximum, and minimum values of T, It is clear 
from the definition that T is greatest when the a’a are the natural numbers 
1,2,..., n] and this is a possible case because it corresponds to ordinary ranking. 
Hence max, ( T) = ( «, 3 -%)/ 12 , 

For the minimum value, consider the polygon A v A 2 , A n . Set up the 
preferences A^A^ Clearly at any vertex this results in one arrow 
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entering and one leaving the vertex, i.e. the contribution to a is unity at each 
vertex. Next set up the preferences A^A^A^.... This circuit may either 
visit each vertex once, or not. In the latter case we proceed to an unvisited 
vertex and set up the preferences A r -+A r+2 -*A r+i ->... and so on. Again there 
will be a unit contribution to all the a’s. 

We then set up the preferences A^A^Aj^ etc. and so on; and in this 
way we shall ultimately complete the preference scheme. 

If n is odd all the preferences described will consist of circular tours of the 
polygon, and thus the value of a for each vertex will be (n~ 1)/2. If n is even 
the last preference A^A^^ will not be a tour but will consist of the single 
line joining one vertex with the symmetrically opposite vertex. Thus there will 
be nj 2 vertices for which a = nj2 and n/ 2 vertices for which a = (n-2)/2, In 
this case T = n/4. 

Now it is clear from the definition of T that it cannot be less than zero, or 
if n is even, be less than nji. The configuration just given shows that these 
minima are, in fact, attainable. 

Thus T can vary from a maximum of [n z ~n)j 12 to a minimum of zero 
or nj 4. Hence the maximum number of circular triads, being half the variation 
from maximum to minimum of T (the maximum of T corresponding to the 
ranking case in which there are no inconsistences) is (n. 3 -4n)/24 if n is even 
and [n s ~n)l 24 if n is odd. 

This establishes the first two results enunciated at the beginning of this 
section. To prove the third it is sufficient to give a systematic method of 
proceeding from the configuration of minimum to that of maximum incon¬ 
sistence by steps decreasing T two at a time. Consider, for example, the case 
n = 8. For the minimum inconsistence the a’s will have the values 0 to 7, which 
we set out thus: 

ABCDEFGH 
0 1 2 3 4 5 6 7 

We proceed by reversing the preferences between vertices whose a-values 
differ by two. This clearly reduces T by two. 

Reversing the preferences between C and E we get 

ABCDEFGH 
0 1 3 3 3 5 6 7 

and between D and F we get 

ABCDEFGH 
013434 6 7 

which we may rearrange as 

ABCEDFGH 
0 1 3 3 4 4 6 7 
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Now reversing the preferences between B and. E and between D and Q and 
rearranging we have 

ABE GFDOH 
0 2 2 3 4 5 5 7 


and now interchanging A and B, G and Id, 

A B E 0 F D G II 
1 1 2.3 4 5 6 6 


At this stage we have preserved the a-numbers 2, 3, 4 and 5 in the middle 
but reduced the extremes A and II. We can now carry out the process again, 
arriving at the a-numbers. 

1 2 2 3 4 5 5 0 

and twice again, giving 

223344 5 5 

whence a final interchange gives 

3 3 3 3 4 4 4 4 


and this is the position of maximum inconsistence. It is readily verified by 
following the interchanges on a polygon diagram that the reversals are, in 
fact, legitimate. 


COEFFICIENT OF CONSISTENCE IN PAIRED COMPARISONS 


9. If d is the number of circular triads in an observed configuration of 
preferences we define 


2id 

- 1 n 3 ~n’ 

n odd 

24 d 

n 3 -in’ 

n even 


•( 2 ) 


and call £ the coefficient of consistence. If and only if it is unity there are no 
inconsistences in the configuration, which may therefore be represented by a 
ranking. As £ decreases to zero the inconsistence, as measured by the number 
of circular triads, increases. 

For example, in the configuration of Fig. 1 there are five circular triads, 
ABD , AGD, AFD, AED and BEF. The maximum possible number is 8. 
Thus £ = 0-375. 

10. £ can also be interpreted in the light of Table I. Suppose, in that table, 
we sum the rows. (The column sums are determined by the row sums and 
add no fresh information.) The sum of any row will be the a-number for that 
vertex in the polygon which corresponds to the object defining the row. T will 
then be the value of the sum of squares of deviations of row totals from the 
mean value (n -1)/2, that is to say, will be the variance of the row sums multiplied 
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n ' £ -* s thus a linear function of this variance; but it cannot be tested in the 
X 2 distribution as if Table I were a contingency table, for the border cells are 
not independent or linearly dependent. 


11. II an individual observer produces a configuration of preferences which 
show inconsistence there are usually several explanations; he may be an 
incompetent judge, the objects may be so alike that consistent differentiation 
is not possible, or his attention may wander during the course of the experiment. 
We discuss these questions later. They are mentioned here to explain the motive 
for the next stage of the mathematics. With what probability can a value of 
£ arise by chance if the observer allots his preferences at random with respect 
to the quality under consideration? 

With n objects there are possible configurations of preferences. We 


(n\ 

proceed to investigate the distribution of d in this universe of 2' 2 ' different 
members. The method consists of proceeding from the distribution for n to 
that for (n + 1). 

For n — 3 there arc eight configurations, of which two give one circular 
triad and six no circular triads. Consider the effect of adding a new vertex D 
to the vertices ABC. Four cases arise: 


(1) D-» all A, B , C. 

(2) two of A, B, C. 

(3) D-> one of A, B , C. 

(4) D-* none of A, B, C. 

The last two are symmetrical with the first two and need not be separately 
considered. 

Situation (1) arises in one way and clearly does not add any new circular 
triads other than those already existing in’the configuration ABC, It therefore 
contributes six values d = 0 and two values d= 1. So does situation (4), 

Situation (2) arises in three ways, according as D<~A, B, or C. The con¬ 
figurations so reached are similar and we may take any one, say D-e-C 1 , as the 
single preference. If A+-C then DAG is not circular and if B*~C the DBG is 
not circular. On the other hand A-+C and B-+C will each produce a circular 
triad. We then have the cases 



No. of circular . 
triplets added 

A^C-rB 

0 

A-+C-+B 

l 

A+-C^B 

1 

A-+C*-B 

2 
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We now consider AB. In the first two eases just enumerated the direction 
of AB does not matter and no circular triads are added. With the third A~>B 
gives no circular triad but A+-B adds one. With the fourth A->B adds one 
and A<-B adds none. 

Thus the number of circular triads occurring for these four cases is found 
to be 


No, of oiroular 
triplets 

Frequency 

0 

2 

1 

2 

2 

4 


We must multiply the frequency by three and by two to allow for similar 
symmetrical arrangements, and the final results are 


No. of circular 
triplets 

Frequency 

0 

24 

1 

16 

2 

24 

Total 

64 


The principles of this method are clear enough and the work may be 
formalized by a number of conventions which we omit to save space. In common 
with many similar combinatorial problems, however, troubles arise from the 
sheer number of possibilities and the difficulty of ensuring that nothing is 
overlooked. Up to the present we have found the distribution of d for n up 
to and including 7. The frequencies and probabilities are given in Table II. 

12, For the values already obtained the moments are given by the following 
formulae: 

K (about 0) = .(3) 



We have very little doubt that these results are true in general but can 
offer no rigorous proof. In so far, however, as the moments are in a sense 
symmetric sums it appears highly probable that they are given by polynomials 








in n\ and if this is so the values obtained are sufficient to establish polynomials 
of degree six or less. 

It is also to be noted that from the above values of the moments 

A = /4//4~ 8 K • /? 2 = /V/4~3 + 12 / w , 

from which it appears that a Type III distribution would fit the d-distribution 
fairly closely for moderate or large values of n. But as the distribution of d is 
of interest mainly for low values of n, which are all that occur in practice, it 
hardly seems worth while attempting to fit a curve. 


Agreement among several observers 

13. We now consider the investigation of similarities of judgments for 
m observers. Suppose that in a table of the form of Table I we enter a unit in 
the cell in row X and column Y whenever X-> Y and count the units in each 
cell. A cell may then contain any number from 0 to m. If the observers are in 


complete agreement there will be cells containing the number m, the 

remaining Q j cells being zero. The agreement may be complete even if there are 
inconsistences present. 

Biometrika xxxi 21 
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Suppose that the cell in row X and column Y contains the number y. Let 




the summation extending over the n(n - 1) cells of the table (the diagonal cells 
being ignored). Z is then the sum of the number of agreements between pairs 
of judges. Put or 

i- . {8) 

m\ ln\ 

UJW 


The maximum number of agreements, occurring if cells each contain m, 

11 J and thus in the case of complete agreement, and only in this case, 

The further we go from this case, as measured by agreements between 
pairs of observers, the smaller u becomes. The minimum number of agreements 
occurs when each cell contains m/2 if m is even or (m + l)/2 if m is odd. That 
is, if m is even, the minimum number of agreements is 


2 \2/ (a) 


and in this case 


When to is odd the minimum value of u is found to be 


14. We propose to call u the coefficient of agreement, It is unity if and only 
if there is complete agreement in the comparisons. Its minimum value is not - 1 
except when m = 2. This, however, is to be expected in a measure of agreement 
for there can be no such thing as complete disagreement among three or more 
observers in paired comparisons. If observer P differs in certain comparisons 
from observers Q and R, the two latter must agree on those comparisons. 

When m = 2, u reduces to 



and Z becomes twice the number of cases in which the two observers agree 
about a comparison, u is thus a generalization of a coefficient r proposed by 
Kendall (1938) to measure the correlation between two rankings. For general 
m, if the entries in the table were constrained to the ranking type, u would be 
the average interoorrelation r between observers taken two at a time. 
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15. In. discussing the significance of u it is desirable to know whether the 
set of preferences which give rise to it could have arisen by chance if the 
preferences had been assigned at random with respect to the quality under 
consideration. The procedure which first suggests itself is a generalization of 
the method used for the case of m rankings (Kendall & Babington Smith, 1939). 
That is to say, we sum the entries in the rows of the table and consider the 
variance of these entries. If the preferences are allotted at random we expect 
to find about equal numbers given to each object, and the variance will be low; 
in other cases it will be higher. 

The difficulty about this suggestion is that it has not been found possible 

to ascertain the distribution of the variance in the possible sets of pre¬ 
ferences . The case m = 1, corresponding to the distribution of d for inconsistences, 
is difficult enough to solve. For higher values of m we have failed to find any 
distributions except in trivial cases. 

We can, however, offer a test based on the distribution of u (or 27). The 
comparative simplicity of the distributions in this case is in accordance with the 
remark made by Kendall in the paper under reference that the distribution 
of r is much simpler and much more regular than the distribution of the 
Spearman correlation coefficient p. 

16. Consider one cell in the table in row X and column Y and let it contain 
the number y. Then the corresponding cell in row Y and column X will contain 

ra-y. Thus these two contribute to 27 the amount + 

Now, of the total ways in which the units can be distributed in the first cell 
j in which y units occur. Consequently the distribution of 27 
in the cell and the corresponding cell is given by the expression 




+ 




( m-r 


Ml) 


+ ...+*■ 


flD 


( 12 ) 


and since the distribution in other pairs of cells is independent if the preferences 
are allotted at random the distribution of 27 for the whole table is given by 


Z>(27) =f N , .(13) 

where N = 

17. The distributions have been worked out for the following values of m 
and n : m = 3, n = 2 to 8; m = 4, n = 2 to 6; m = 5, n = 2 to 5; m = 6, n = 2 to 4. 
Tables III to VI give the probabilities based on these distributions, i.e. the 
probabilities that a given value of 27 will be attained or exceeded. 

For constant n the distribution tends to the Type III form as m tends to 
infinity. In fact, for a single pair of related cells the variate value corresponding 

22-2 






TABLE III 

The probability P that a value of 2 mil be attained or exceeded, for m = 3, n = 2 to 8 



TABLE IV 


The probability P that a value of 2 will be attained or exceeded, for m = 4 and n = 2 to 6 
(/or n = 0 only values beyond the 1 % point are yiven) 


















TABLE V 

The probability P that a value of E will be attained on exceeded, 
for m = 5 and n = 2 to 5 



TABLE VI 


The probability P that a value of E will be attained or exceeded, 
for m = 6 and n-^toi 
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to a frequency is + which is a quadratic in y. Were the 

variate value a linear function of y the distribution for the single cell would 
tend to normality in accordance with the well-known property of the binomial. 
The case of the quadratic value corresponds to a transformation of the variate 
of the type x 2 = y and the transform of the normal form exp (-a: 8 ) dx becomes 
the Type III form exp(-y) y~ l dy. Since the N cells are independent and the 
sum of variates in the same Type III form is also distributed in that form, it 

follows that X 1 is in the limit distributed as exp (—£) if 2 dS except perhaps 
for some constants. Thus X’ or some multiple of it is distributed as y 2 . 

For constant m the distribution tends to normality with increasing n, 

18. The first of these results suggests that the Type III distribution will 
provide an approximation to the distribution (13) when m is moderately large. 
We proceed to find the first four moments of (13). 

It is sufficient to find the first four moments of (12), those of (13) being 
obtainable therefrom in virtue of the relationships which connect seminvariants 
of independent distributions. 

The rth moment of (12) about the origin is given by 


’% ■■ 


4)V 


mi 


since 2 m is the total frequency. Thus we have 


...(H) 


. “•> 

( ?7l\ 

r Jr p can be obtained by operating on the binomial (1 +a:] 


p times by x e.g. we find 




m 


W M-2«2, 


to , 1 /to 


S . r* = 2»£ + 


2 1 212 


and hence, substituting in (15), 


, l/m\ 

Thus the mean of the distribution (13) is given by 

/*;=<)• 


( 10 ) 
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In a similar way we find 

H = ’ .( 17 ) 

/ta=p(g), .(18) 



3m 2 - 15m +17 
' 8 


+^ s N{m i -m) 


( 10 ) 


These are the moments of E. Those of u are obtained by dividing by an 

( wA 

9 I and it may be noted in particular that the mean 

of u is zero. 

We have directly from (17), (18) and (19) 

_ 8 (m-2) 2 
Nm(m- 1 )’ 

A “ Hfibiy H “ 16m+17 +T TO(m - ‘I 

For constant m, as N -> 00 , 

/? 2 ->3 

and for constant N, as m-*oo, 

8 12 


confirming the tendency towards the Type III distribution. 
19. The first four moments of the Type III distribution 

dF = ke~ px x q ~ l dx 


are 


p p v p z> p* 


Equating 
we find 


the second and third moments to those given by (17) and (18) 


Nm(m- 1 ) 
2(ro-2) r ’ 


( 20 ) 


2 


P = 


m —2’ 


( 21 ) 


To make the first moments correspond we move the origin of the X dis- 
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( ?7l\ TYb _3 

—— to the right. We tints reach the approximation 
Z J 7/1 — Z 

to the £ distribution, coinciding in the first three moments 

dF-ke dx, 


where 


a: = 27- 



m — 3 
to -2 


or, transforming to the more usual y 2 form by putting y 2 
that 


tr-iff 
l “ 




is distributed as y 2 with 


Nm(m- 1) 
( to - 2) 2 


4;t:/(m - 2), we find 

.( 22 ) 


( 22 ) 


degrees of freedom. 

The fourth moments of 27 and the y 2 approximation differ by terms of order 
N~ l and to - 1 compared with their absolute values. 

20, It only remains to be seen how largo to and n must be for this to provide 
a satisfactory approximation. 

Consider first the distributions for to = 3. When n - 8, N = 28, we have, for 
the approximation, 427 distributed with 168 degrees of freedom, From Table III 
we see that for 27 = 54, P = 0-011 and for 27=58, P = 0-0011, Applying a 
continuity correction by deducting unity from 27 we find for the y 2 approximation 
with y 2 = 4x53, v=168, P = 0-011, and with y 2 = 4x57, P = 0-00114. The 
correspondence is very close, in spite of the low value of to. 

For to = 4, n = 5, N = 10, the approximation gives 227-30 distributed with 
30 degrees of freedom. For E = 40 and 41, this gives, with continuity corrections 
of 0-5, half the variate-interval, y 2 = 49 and 51, v= 30. From the diagram 
given in Yule & Kendall’s “Introduction to the Theory of Statistics” (1937) 
it is seen that these values lie one on either side of the 1 % value; and this is 
in accordance with the exact values of P, which are seen from Table IV to 
be 0-016 and 0-0088. Similarly we find that the values of 27, 37 and 38, lie on 
either side of the 5 % level, which is again in accordance with the exaot values, 
P = 0-060 and 0-038. 

For to = 6, n - 4, N = 6, the approximation gives 27-33-75 distributed with 
11-25 degrees of freedom. For 27 = 59 and 60 the corresponding y 2 values are 
seen to lie on either side of the 1 °/ 0 point, which accords with the exact value 
of Table VI. 

We conclude that the y 2 approximation provides an adequate test of 
significance for the values of to and n outside the range for which Tables III-VI 
give exact values. 
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21. As a matter of theoretical interest we may record the results for the 
distribution of u when the data are ranked. It appears that in this case 

12 2n + 5 1 /m\/n\ j 1 (2n + 5) 2 f 

m-22n s +6n + 7i 2 ' 2\2/\2 jj 1 3(m - 2) 2n 2 + 6n + 7j_ 
is distributed approximately as y 2 with 

(“) fa=a?(t) (Xto+7) degree8 ot freetlom ' 

This result is not of much practical value. The case of m rankings can be 
more simply treated by other methods. 

Interpretation op results op paired comparisons 

22. In the light of the foregoing theory we may discuss the interpretation 
of the results of a paired comparison experiment. 

If for each observer the coefficient of consistence is unity the comparisons 
reduce to rankings and may be discussed by known methods. But if some 
or all the coefficients are not unity we have to consider the following 
possibilities: 

(а) Some of the observers may be bad judges and the inconsistences reflect 
their shortcomings in making comparisons. 

(б) Some of the objects may differ by amounts which fall below the threshold 
of distinguishability for some observers. 

(c) The property under judgment may not be a linear variate at all and 
we may be getting the sort of confusion which would result if observers were 
asked to compare English towns according to the bivariate concept “ geographical 
position”. 

(d) Several of the effects may be operating simultaneously. 

23. If we have only one observer and have no prior knowledge of his 
capabilities it is not in general possible to apportion his inconsistences among 
these causes. Exceptions may occur when the inconsistences are of a marked 
and peculiar kind; for instance if they involve only four objects out of 15, we 
may suspect that the four are practically indistinguishable rather than that the 
observer is unable to make distinctions at all and avoided inconsistences among 
the others by sheer chance. But even here conclusions drawn a posteriori after 
inspection of the data are dubious. Table II gives a test of the hypothesis that 
an observer is incapable of making judgments. Eor example, with n = 7, the 
chances are 983 in 1000 that if the preferences are made at random there will 
be more than two inconsistent triads, so that if we find two or less, it is 
improbable that the observer is completely incapable of judgment. We might 
then be led to suppose that his small deviation from internal consistence is 
due to fluctuation of attention, very close resemblance to the objects giving 
rise to the inconsistences, or both. 
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24. With m observers the investigation can be taken a good deal further. 
If all the observers show inconsistences we suspect that the objects are at fault 
or that the observers are being asked to perform an impossible task. On the 
other hand, if most of the observers show a small or zero inconsistence we 
suspect that the others are just bad judges and may reject their data 
accordingly. 

As between indistinguishability of objects and non-linearity of variate, a 
choice of explanations would depend largely on the extent to which incon¬ 
sistences were concerned with the same set of objects. If there is a high value 
of u t indicating concordance of judgment, we expect to find most of the 
inconsistences confined to certain objects, and common to observers. In this 
case we suspect that the objects are close together in the degree to which they 
exhibit the quality under consideration. .But if the observers scatter their 
inconsistences over the whole field u will be moderate or low and we suspect 
that the observers are being asked to do something beyond their capacity; and 
this brings us to question the validity of regarding the quality as a linear variable. 

25. When a quality such as “bravery” or “intelligence” is insusceptible 
to measurement there is frequently doubt of this kind. But this lias not 
deterred investigators from assuming that such statistical variables exist, or 
from requiring observers to rank objects according to them, or in some cases 
from replacing such rankings by quantiles of the normal curve. We are never 
tired of criticizing this Principle of the Hypostasis of Plausible Terminology. 
Hitherto it has flourished largely because of the difficulty of adducing evidence 
against it; and we hope that the inconsistence of paired comparisons will provide 
a criterion, however rough, of the legitimacy of the methods to which it leads. 

But we would emphasize that our approach to the method of paired com¬ 
parisons has a somewhat different object from that elaborated by Thurstone 
(1927 and many subsequent papers). As we understand it, his method is appro¬ 
priate where one is entitled to assume a priori or by reason of precautions taken 
in the selection of material that a linear variable is involved and that there exist 
perceptible differences between the items presented for comparison. Our object 
is to make it possible to dispense with such assumptions and precautions. 

26. A few words may be added about the case in which an objective order 
is known to exist (as, for instance, in judging individuals according to age or 
weight). In such circumstances the appearance of inconsistences will indicate 
unreliability of the part of the observer or subliminal differences between 
objects. A measure of the observer’s reliability may be obtained by calculating 
u between known and observed comparisons. If £ is high enough to enable us 
to accept his judgments as internally consistent on the whole, u may still be 
low enough to reject his judgments as accurate. 

27. We conclude with an example of the application of the foregoing theory 
to some experimental material. 
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Classes of children (ages U to 13 inclusive) were asked to state their 
preferences with respect to certain school subjects. Each child was given a 
sheet on which were written the possible pairs of subjects and asked to underline 
the one preferred in each case. Two classes gave the following results: 

(a) 21 boys, 13 school subjects. The preferences are shown in Table VII, 
which is in the form described in section, 13; e.g. there were 18 boys who 
preferred Art to Religion, 

TABLE VII 

Preferences of 21 hoys in 13 subjects 

I 2 3 4 5 6 7 8 9 10 II 12 13 Total 

1. Woodwork 

2. Gymnastics 

3. Art 

4. Science 

5. History 

6. Geography 

7. Arithmetic 

8. Religion 

9. English Literature 

10. Commercial subjects 

11. Algebra 

12. English Grammar 

13. Geometry 


The calculation of £ for this table, in which the objects are arranged in 
order of total number of preferences, may be shortened by noting that £ as 
given by equation (7) may be transformed into the form 

where the summation now takes place over the half of the table below the 
diagonal. Since the numbers in this half are smaller than those in the other 
half there is a considerable saving in arithmetic. 

We find £ = 9718 



14 

20 

15 

15 

16 

16 

18 

18 

18 

20 

21 

20 

211 

7 

— 

14 

12 

13 

18 

14 

16 

16 

20 

16 

18 

19 

183 

i 

7 

— 

10 

14 

10 

16 

18 

16 

16 

17 

16 

19 

160 

6 

9 

11 

— 

11 

12 

15 

14 

13 

13 

17 

17 

16 

154 
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10 
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14 

11 

12 

14 

15 

13 

14 

16 
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5 

3 

11 

9 

7 
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14 

14 

13 

13 

16 

15 

17 
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i5 

7 

5 

0 

10 

7 

— 

9 

11 

13 

15 

13 

15 
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3 

5 

3 

7 

9 

7 

12 

-- 

12 

14 

14 

16 

14 

116 

3 

5 

5 

8 

7 

8 

10 

9 

— 

10 

13 

13 

15 
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3 

1 

5 

8 

a 

8 

8 

7 

11 

— 

10 

10 

14 

91 

1 

5 

4 

4 

8 

5 

6 

7 

8 

11 

— 

10 

13 

82 

0 

3 

5 

4 

7 

6 

8 

5 

8 

11 

11 

— 

13 

81 

1 

2 

2 

5 

5 

4 

6 

7 

6 

7 

8 

8 

— 

61 

j Total 

1638 


and hence 


2x9718 


There is thus a certain amount of agreement among the children, indicated 
by the positive value of u. Is this significant? 

We note first of all that this distribution of preferences could not have 
arisen by chance to any acceptable degree of probability. In fact, y 2 = 412-4 
(equation (22)) and v = 90-7. The large value of v justifies the use of the normal 
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approximation to the x 1 2 3 4 * 6 7 8 9 10 11 distribution and we find f(2x 2 )-f{2p~ 1) = 16-3 a very 
improbable result on the hypothesis of a random allocation of preferences. 

The distribution of circular triads was as follows: 


No. of triads 

Frequency 

No, of triads 

Frequency 

0 

l 

12 

1 

l 

1 

17 

3 

4 

6 

21 

l 

(i 

2 

25 

i 

7 

2 

29 

I 

8 

] 

39 

1 

10 

1 

Total 

21 


The total number of circular triads was 242 with a mean of 11*5. Only 
one boy was entirely consistent, On the other hand, for n — 13 the maximum 
number of circular triads is 91, with a mathematical expectation of 71-5. It is 
thus clear that, except perhaps for one boy, we cannot suppose that any boy 
allotted preferences at random. We are again led to conclude that the boys 
are genuinely capable of making distinctions, and that consistently on the 
whole. Half the boys have coefficients of consistence £ greater than 0-92. 

We conclude that the boys can make preferences and that in their view the 
subjects are sufficiently different to enable a reasonably consistent set of 
preferences to be made. So far as these data are concerned we would see no 
objection to the assumption that a scale of preferences can be set up. With 
this in mind we can say that the value of u indicates a certain amount of 
agreement, though not a strong one, between the boys as to which subjects 
they prefer. 

(b) 25 girls, 11 school subjects. Table VIII shows the data. 

TABLE VIII 

Preferences of 25 girls in 11 subjects 

1.2 3 4 5 6 7 8 » 10 11 Total 


1. Gymnastics 

2. Soionoo 

3. Art 

4. Domestic Science 

6. History 

6. Arithmetic 

7. Geography 

8. English Literature. 

9. Religion 

10. Algebra 

11. English Grammar 


_ 

10 

19 

17 

20 

17 

21 

21 

21 

18 

22 

180 

15 

— 

12 

15 

17 

15 

21 

10 

18 

10 

17 

105 

6 

13 

— 

10 

16 

18 

10 

17 

10 

1!) 

10 

147 

8 

10 

9 

— 

16 

11 

13 

15 

14 

11 

1.4 

121 

5 

8 

9 

9 

— 

14 

18 

12 

13 

15 

18 

121 

8 

10 

7 

14 

11 

— 

12 

13 

12 

16 

18 

121 

4 

4 

16 

12 

7 

13 


14 

15 

14 

14 

112 

4 

6 

8 

10 

13 

12 

11 


14 

13 

14 

105 

4 

7 

9 

11 

12 

13 

10 

u 

— 

11 

17 

105 

7 

9 

6 

14 

10 

9 

11 

12 

14 

— 

12 

104 

3 

8 

9 

11 

7 

7 

11 

11 

8 

13 

— 

88 

Total 

1375 
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We find £ = 8928, u = 0-082. 

For the y 2 significance test, f = 180-3, v = 62-4 and ^l{2f)-^j(2v~l) = 7-9, 
as before a very significant result. 

The distribution of circular triads was 


No. of triads 

Frequency 

No, of triads 

Frequency 

1 

2 

1,7 

1 

2 

2 

19 

1 

3 

1 

22 

1 

4 

1 

23 

2 

6 

1 

27 

1 

8 

1 

■ 32 

1 

9 

1 

35 

1 

11 

2 

37 

1 

12 

2 

38 

1 

13 

1 


— 

14 

1 

Total 

25 


The total number of circular triads is 382 with a mean 15-28. For n = 11 
the maximum number of circular triads is 55 with an expectation of 41-25, 
Several of the girls come very close to this, the worst having a coefficient of 
consistence equal to 0-31. 

We are, however, again led to conclude that the preferences were not 
allotted at random and that most of the girls are capable of exercising a judgment 
which is on the whole consistent. There is only a very slight agreement in 
preferences. 

Thus the girls are less consistent and less alike in preferences than the boys. 
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THE MEAN AND VARIANCE OF x 2 , WHEN USED AS 
A TEST OF HOMOGENEITY, WHEN EXPECTATIONS 

ARE SMALL 


By J. B. S. HALDANE, F.R.S. 


Pearson’s measure of divergence, X 2 , can be used not only as a test of goodness 
of fit, but as a test of homogeneity when the expectations arc unknown before 
the samples are observed. Consider the (m x n )-fold table: 


«U 


a w 

' 11 tt lHi 

h 

tt 21 

a n 

a, i'i 

... a im 

S 2 

®31 


tt 33 

••• &3 m 

S 3 




a n 

a n3 

• ’ • 11 nm 

K 

k 

k 

h 

ki 

N 


Here each a tj represents a number of individuals observed, and 

j 

tj = E a i}) N = = S V The table may represent n samples of a 1 , s % . s n 

i i j 

individuals, each sample falling into m classes, t v £ a ,.... l m being the grand totals 
in each class. Or it may represent m samples of t v t 2> ■.., t m individuals, each falling 
into n classes, the class totals being s x , s 2 ,s n . In each case we ask what is the 
probability of so bad a fit if every sample is taken from the same large population. 


The expected value of is clearly —/•, and 



NY 


Fisher (1022) showed that when every s i and tj was sufficiently large, X 2 has 
the usual distribution, with (m- 1) [n-1) degrees of freedom. Thus its mean is 
(m-1) (n-1), its variance 2(m— 1) (rr— 1). Haldane (1937, 1938,1939) investi¬ 
gated the exact values of the moments of y a when expectations are small, in 
(m x w)-fold tables with nm or m(n~ 1) degrees of freedom, but did not attempt 
the present problem. This had previously been done by Cochran (1936), in the 
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restricted case when m = 2, and every s. £ is equal. Cochran used a method based 
on the use of characteristic functions. My own method is entirely elementary, and 
completely exact, though rather tedious. My results are very close to Cochran’s, 
but the accurate values are worth putting on record. 


The mean value of % 2 

Given the marginal totals, the probability of any particular distribution is 

rw-)n(M) 

p= ~¥Tnfeip 

ij 

And this is so whatever may be the true expectations. Thus, if in the sample s { 
the expectation of the observation is s { pp so that that of tj is Npj, then the 
probability of the given distribution is 

nipflnw) 

~nM“ ' 

ii 

But the probability of the given marginal totals ij is 

W 

j 

Hence P is the probability of obtaining any particular distribution with the given 
sample sizes s { and class totals 

If E(x) denotes the sampling expectation of®, then $(%) = Z Pay, summation 
being taken over all samples possible with the given marginal totals, Hence 

E[a ' ij) (N - 1 )! (ctjj - 1 )! Tli^kJr ’ 

Id 

where Jc assumes all positive integral values between 0 and n except i, and l all 
positive integral values between 0 and m except j. That is to say the quantity 
summed is the probability of a set of observations similar to those of the table 
except that s t and t 3 - have been diminished by unity. Hence the sum equals unity. 


W r n(n n1 

■U'lPyvhj 1 )J ~ N{N- 1) ’ 

Win n \ — S i Sk ^i~ 1 ) 

N{N-l) ’ 




s i s k^jh 

N(N-iy 


Similarly 
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and so on. Hence 


The mean and variance of y 2 




Nfcm-N 

Lii (MjU 

N^-rFAa^l-N 
ij L'V'j J 

«L J 




= —-j (N z -nN ~mN + ran/V) -. 
(to- 1) (w- l)iV 

- jv-T ’ 


a result previously obtained by Bartlett (1937), 

This clearly becomes (m-l)(n - l) when N tends to infinity. In the case 
where n = 2, and every s ; = s, we have 


(n- l)ws 
ns- 1 


Cochran gave 


X 2 = n -1 + 


ns ~ s +1 


This exceeds (2) by -4-——. and is therefore correct to O^- 1 ). It will be noted 
ns z {ns- 1) 

that the mean depends only on the number of degrees of freedom and the grand 
total N. The higher moments involve the marginal totals s i and t jy and are 
therefore more complicated. It is clear that the expectation is diminished because, 
in sampling from a finite group of N individuals, E(a tj ) has the same value as in 
sampling from an infinite population with the same frequency, but E(afy) is less. 


This variance in a (2 x w)-eoli) table with equal samples 

Owing to the complexity of the general expression for the variance, we shall 
first calculate it for the (2 x n)-fold table: 


% 

% 


■ K 

A 

h 

b* 

6 3 . 

■ b n 

B 

s 

s 

s 

. 8 

_ 
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Here u samples of s are taken, and each falls into two classes, say a { healthy 
and b £ diseased. The totals are A and B, and N = us = A + B, It will be con¬ 
venient to put A = pN, B = qN. 


X 



2 = — S (sp - a ) 2 

m 

spq q 


1 


s 2 pY^ a ^' 


Now 


(M 2 = S ot + 2B«?4 

i tj 


summation being taken over all unequal pairs of values of i and j. 

Hence 

E{Sa\f = nE{a\) + n(n- 1) E(a\a]) 

= -1) K: - 2) (a* - 3) + 6a, .(a,. -1) (a { - 2) + 7a>. £ -1) + a t ] 

An{n-l) E\a i {a i -1) a.(a^ -1) + 2-1)a,- + a { a.] 

= -1 )(«-*)'(«- *)+n(» -1) « 2 (* -1 ) 2 ] 


+ W- 1 1)1f 2) («-2) + 2n(*-1)^-1)] 

+ 4( F ~?) [ 7ws (s - 1 )+»(» -1) « 2 3+^ 

Since N — ns, A = pus, we have 


JL 

pns 


E(£aj) 2 


(A-l)(A-2)(A-3) 
(N — 1) (N—2) {N~ 3) '* 


l)[u« 2 ~-(u + 4)s + 6] 


2(4-1) (4-2) 
+ (JV— 1) (N — 2) 


(s-l)[(u + 2)s-6] 


+ jg —j [(u + 6) s — 7] + 1> 


J^) 2 = 


pu 2 s 3 


■ 1) (us-2) (us- 3) 


[p 3 u 3 s 3 (s -1) {us 2 ~ (u + 4) s + 6} 


+ 2p 2 u(u -1) s(s -1) (us — 6) 

+ p(u — 1) {u(u +1) s 2 — 2(3u - 4) s - 2} — 2(u -1) (s -1)]. 


Biometrika xxxi 


13 
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So the variance of y 2 is given by 


W = 


2\2 


\n-l)ns pns 
(ns- 1) q . 


E(Sa^ 

P 2 q 2 s 2 

E(Za\f n 2 s 2 [pn(s-l) + n-lf 
p 2 q 2 s 2 


q 2 (ns- 1) 2 


n 2 s 


pq 2 (ns -1 ) 2 (ns - 2) (ns - 3) 

+ 2p 2 n(n -1) s(s -1) (ns - 6) 


[(ns - l){p 3 n 2 s(s-1) (ns 2 -n-4s+ 6) 


+p(n~ 1) (nn+ Is 2 -fin- 8s- 2)- 2(n- 1) (s- 1)} 
-ps(ns - 2) (ns - 3) {pn(s -1) + »-1} 2 ] 

2n 2 (n- l)s(s- 1) 


pq 2 (ns -1) 2 (ns - 2) (ns - 3) 


[p 2 n 2 s 2 - 2 p 2 n 2 s 2 + p(n 2 s 2 + ns -1) - (ns -1)] 


2n 2 (n -1]is(s -1) )(pqn 2 s 2 - ns + 1) 
pq(ns ~ 1 ) 2 (ns - 2) (ns - 3) 


2n 4 (n- l)s 3 (s-1) / ns~l\ 

(ns-l) 2 (ns-2)(ns-3)\ AB J 

2(n-\)N*(N-n) ( N-l\ 
(N-l) 2 (N-2)(N-d)\ 1 AB)' 


This becomes 2(n-1) when A and B tend to infinity. 

When A and B are both of order N, that is to say neither p nor q is small, 
it becomes 


2(n-1) i-(n+.i- 7 )w-i 


+0(N~ 2 ). 


Cochran’s expression is 2(n-1) (l-niW^ + O^- 2 ), which is accurate when 
J21 . 

P ~ i ± i ne. 0-8273 or 0T727, It will be seen that the variance is diminished 

when n > 7 - ^, which is certainly the case if n > 3. If A remains finite when N 
and B tend to infinity we find y 2 = n- 1, 


F(x 2 ) = 2(n-l)(l-I), 


a formula already given by Haldane (1937). 

Formula (3) may be compared with the values given by Haldane (1937) for 
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they X arc" f ° ld ^ ^ * degre6S offreedom - In tlie terminology of this paper 

X s = *, 

s m 1 

Similarly the limiting case given above may be compared with the values for 
ti ft'■fold table with n degrees of freedom, namely, 

j?=»> 

I / (X 2 ) = 2n + ~. 


The variance for a (to x?i)-eold table 

1 his of course includes the variance found above as a special case. However, 
as the summations involved are somewhat hard to follow, the simpler algebra 
for the special ease has been given separately. 



Here summation over the values of i, k on the one hand, and j, l on the other, are 

independent and commutative. 2K*) = 2lF2( a: )l, where k assumes all values 

ilc i Lit J 

from o to n inclusive, except i. 210*0 has a similar meaning. 

ji 

Thus 21(1) - n, 21(5,.) « N, Sp + £ (1)J = « 2 , + S s / ( J = nN, and so on. 

Let S = 21st 1 , T = 21<7 1 . 

i i 

Then 

a % * a ij( a ij - 1 ) (»« - 2 ) (%■ - 3) + fayfaj - 1) Ki - 2) + 70y(0y - I) + *«, 

= a ij{ a i]~l) a kj( a kj~ ^)~k a ij{ a ij~^) a kj + a ij a kj{ a kj~' l) + a # a W> 

and so on. The expectations of the two middle terms in the last expression are of 
course equal. Remembering that 


m a t a \\( a o)( a on s i( s i~ l)( 5 i - 2)K-3)t,(iy —l)(t,-2)(^-3) 
M%( “« -1) (»« - 2) (»« - 3)] =-V(jV-I)(<V-2)(iV-3)-' 


and ^[%(%-l)%K,-l)] 


s i( s i ~ 1) 8 k( s k ~~ 1) ~ 1) (tj ~ 2 ) m ~ 3) 

N{N-l)(N-2){N-Z) 


23-2 



352 


The mean and variance of y 2 


and so on, we have 
N^EKxU-Nf] 


( s i~ 1) i s i~ 2) (*?{—! 


-2)(t # -3) 


N(N— 1) (N~2) (N- 3)[_y 


2HSi ~ 3 )( ^ - - 1 - )( ^ 1 ) + S (*,-1) (.5 fc ~ 1)(^-X) (t,~ 1) 

ijl s i ikjl 

, 1 r« v ( 5 *"' 1) ( s i “ 2) [tj — 1) (^ — 2) 

^~1)(^-2)L 0 ^ Mi 

+ 2s (gizMrilfcl) +2E fo- 1 )^- 2 )^- 1 ) 


+ 2 £(* 


t -l)(fi~l) 


1 r + £*i-i + Ijfci + s (l) 

•* XJL ij Mi ifej ij ijl S 1 ikjl 


] 4 ?( 4 )- 


Hence 




" (iV- 1 )^- 2 )(^^ 3 ) 6s ' J+l1 6 s r l ) + S(Si l)(«fc-l) 

X 6^ + 11- 0^-1) + 2 (tj -1) (*,-1) 

Li ji 

"*■ i) (jy_2) 2 S( s i~3 + 26' i x ) - 3 + 2^ _1 ) 


+ js(«i - 3 + 2«r Jl ) + S(3f-1)1 (s(«i - 3 + 2 tj-i) + S(L - 1 )) 

1 ik ) \ j ji )_ 

+ tf~T [ 6 S(1 - a*- 1 ) 2(1 - if 1 ) + (s (1 - M 1 ) + £ (1)} 

xfsa-t^+sojn+is^s^- 1 ) 

u n )j iv ( j 

= (F21)^-2) (i\r_3)^ 2 ~ 2 ( n + 2)iV r +n 2 +10n-6»S] 
x[y 2 -2(m + 2)y + m 2 +10m-6r] + ~ 2 

x [2(iX- 3» + 2S) (tf- 3m + 2T) + {nN-n i -2n + 28) (mN - m 2 - 2m + 2T)] 
+ iV^Ti f 6 ( n " ^) ( w - T ) + ( wa - ^) (^ 2 - 
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N-'{N-l)(N-2)(N-3) E [( X 2 + iV) 2 ] 

= [N z -2(n + 2)N + n 2 + lOw- 6$] [N 2 — 2(m + 2)N + m*+ 10m-6T] 

+ 2{N- 3) [( nm + 2)N 2 

- {nm{n + m) + 4«,m + 6(w + m) - 2(n + 2)T-2(m + 2) S}N 
+ n z m z + 2 nm(n + m) + 22 nm - 2(n* + 8 n)T- 2 (m 2 + 8m) £ +12 ST] 

+ {N-2){N~ 3) |> 2 m 2 + 6 nm - (n z + 6 n)T- (m 2 + 6m) 8 + 1ST] 

+ {N-l)(N-2)(N-Z)ST 
= N i +2(a-/l — 2)N a + [(a,-fi) z -6ct + 6/] + 4 c ]N z 
+ (- 3a 2 + 8a/? - 4/S 2 + 6a - 4/?) + a 2 - 2a/?+4a - [(% 2 + 2n~2)T 
+ (m 2 + 2m - 2) /S] N z + [(n z -2n)T + (m z - 2m) S] N + N Z {N +1 ) ST, 
where a = wm, /? = w+m. 

But ^( x 2 +iy) = ^ ~ 1 ^~ 1)y +jy 


Hence 


N(N + a-fi) 
N -1 


-1) 2 (i\? - 2) (N - 3) F(x 2 ) 

- (N-l)[N i + 2(<x-P-2)N 3 +etc.]-N(N-2)(N-Z){N+u-/3) z 
= 2(a-fi + l)N 3 + (a. z + 2p-4:)N i -2(a 2 -ct.fi+ / 8 z + a-2i3)N 
— a(a - 2/? + 4) - [(n z + 2n - 2) T + (m 2 + 2m- 2) $] N Z (N - 1) 

+ [{n z -2n)T+ (m 2 - 2m) S]N(N-1) + STN z (N z -l). 

Thus 

N 

F ^ 2) = (iV — 1) (iV — 2) (N - 3) 


'2(«,-1) (m- l)jV 3 + (n 2 m 2 + 2«. + 2m-4) N z 

-2{nm(n-l)(m-l) + (n+m) [n + m-2)}N-nm(n-2) (m-2) 

x [- IfTI 

- {(n z + 2n - 2) T + (m 2 -f 2m - 2) 5} N z 

+ {n{n-2)T + m{m-2)S]N + STN z {N+l), . (4) 


where S and T have been defined on p. 351 above. 
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The mean and variance of x 2 

When there are n samples of s {) s 2 , s 3 , s n members, falling into two classes 

N 

whose totals are A and B , we have m = 2, T - j-g, S - 2 a)' 1 , so 


F(x 2 


iW 


(j\r-l) 2 (W~2)-(tf-3) 


2 ( 7 ?.— 1) iV 2 + 2n(2n + 1) iY - 6n z 


- 6 N(N-l)Es^+ N -V~ 1} { N(N +1) Zs ; 1 - (n 2 + 2 n - 2) N + n(n - 2)}^ 

.(5) 


When every s ; = s, this reduces to formula (3). In the case of the fourfold 


table 


F(X a ) 


a 

b 

c 

d 


, it becomes: 


Y 2 


(N -1) (Y- 2) (IV- 3) 
1 


"2(Y 2 + 10Y -12) 


- 6Y 2 < 




l 


Y-l 


+ . 


Y 3 (Y +1) 


(a + 6j(c+d) (a + c)(6 + d)j (o + 6)(ch- cf)(a+c)(Hd) 


. ...(b) 


Discussion 

The expressions for the higher moments would clearly be a great deal more 
complicated. The above calculations have, I think, a twofold interest, They show 
that the loss of one degree of freedom arises from the fact that we are sampling 
from a finite, and not an infinite, aggregate. Arid they point the way towards 
an exact treatment of the problem of curve fitting, for which Pearson originally 
designed the measure y 2 . In the case of a (nx 2)-fold table with (n — 1) degrees 
of freedom we are, in effect, asking whether the observations give a satisfactory 
fit to a horizontal straight line y = k, where ?/ is the observed frequency of a type 
within a sample. If we were trying to fit a line y ~ k +• lx, we should have n~~ 2 
degrees of freedom, x having a different value for each sample, If we were trying 
to fit a normal curve we should have n- 3 degrees, and so on. Jeffreys (1938) 
has pointed out the great difficulties of curve-fitting in such cases when expecta¬ 
tions are small. It is clear that the expected value of y 2 in such a case is not 
exactly n-3. It may turn out to be slightly greater. Thus an investigation of 
the actual law of error in a particular type of observation will demand an extension 
of the present investigation to cases where several more degrees of freedom are 
lost. 
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A NOTE ON THE STATISTICAL ANALYSIS OF SENTENCE- 
LENGTH AS A CRITERION OF LITERARY STYLE 


By C. B. WILLIAMS, So.D. 

Department of Entomology , Rothamsted Experimental Station 

Some years ago I made a number of calculations of the frequency distribution 
of words of different length in different books to see to what extent authors 
kept to a definite distribution and so perhaps might be identified by such a 
method. The results obtained, however, were not striking and the work was put 
at one side, 

Mr Udny Yule (1039), however, has attacked the problem of authorship 
from the angle of the variation in sentence length, and this appears to be a 
much more fertile method of approach. 

Mr Yule shows that the frequency distribution of sentence length (i.e. 
number of words between successive full stops) is of the skew type and by 
comparing in two different manuscripts, the mean, the median, quartiles and 
deciles he is able to produce convincing mathematical evidence on the identity 
or otherwise of their authorship. 

Mr Yule does not comment on the skew distribution further than to state 
(p. 371) “they are not of the Poisson type, but of the type in which the square 
of the standard deviation largely exceeds the mean”. 

When I converted some of Yule’s tables into diagrams I was struck by their 
general resemblance to certain skew distributions with which I have recently 
been dealing in some Entomological problems, and which distributions, I found, 
became normal and symmetrical if the logarithm of the number was taken as 
a basis for subdivision into groups instead of the number itself (see Williams, 
11)27). 

I was unable to test this transformation on Yule’s figures as he unfortunately 
does not give the original data, but only the word length of sentences in groups 
of five; so it was necessary to obtain some new data. 

These I obtained by counting the number of words in each of (100 sentences 
from the following three books; 

(1) G. K. Chesterton, A Short History of England, 1917. 

(2) H. G. Wells, The Work, Wealth and Happiness of Mankind. 

(3) G. Bernard Shaw, An Intelligent Woman’s Guide to Socialism. 

All three works deal with the exposition of somewhat similar sociological 
subjects and none of them are in the “conversational’’ style. 

The selection of the sentences was randomized as follows. Each of the books 
is divided up into chapters, sections or both. In Chesterton’s book the first 30 
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sentences were counted in each of the first 20 chapters. In Wells’s book the 
first 10 sentences were counted in each, chapter subdivision up to chapter vn, 
division 11. In Shaw’s book the first 15 sentences in each of sections 1-40 were 
taken. In each case the greater part of the book was covered. 

The original data thus obtained are shown diagrammatically in Tig. 1. 
Each of the distributions is of the typical skew type obtained by Yule: Shaw is 
the most extreme and varies from 3 words to 143; Wells is less skew and ranges 
from 3 to 91; while the Chesterton curve is the least skew and varies from 5 to 91 
with only two values over 60. 

Erom Table I it will be seen that the arithmetic mean number of words per 
sentence is 25-87 for Chesterton, 24-11 for Wells and 31-23 for Shaw. The 
medians are also different and presumably the quartiles and deciles, but these 
latter were not calculated. 


TABLE I 

Frequency constants of the distributions of sentence length 



Chesterton 

Wells 

Shaw 

Number of sentences 

Number of words 

Arithmetic mean no. of words 
Median no. of words 

Mean log no. of words 

Geometrical mean no. of words 
Standard deviation of mean log 
Standard error of mean log 

600 

15,621 

25-87 

25-3 

1-37 

23-5 

0-200 

0-0080 

600 

14,463 

24-11 

20-8 

1-31 

20-5 

0-237 

0-0095 

600 

18,735 

31-23 

26-0 

1-39 

24-5 

0-290 

0-0112 

_ 


If, however, instead of taking the frequency distribution of the actual 
number of words per sentence we take that of the logarithm of the number we 
get the distributions shown in Figs. 2-4. They undoubtedly show a very close 
resemblance to the "normal distribution”. The mean log and standard deviation 
for Chesterton is 1-37+ 0-20; for Wells 1-31 i0-24 and for Shaw 1-39+ 0-29. 
The standard error of the mean is, owing to the large number of observations, 
in all cases very small and approximately +0-01. 

On each of the three figures is superimposed a normal curve of the same 
area, mean and standard deviation and it will be seen how closely it fits the 
observed values. 

The following comments may, however, be made: 

(1) The greater irregularity of the observed values in the lower portion of 
the distribution is due to the irregular distribution of the logarithms of integers 
when grouped in small artificial divisions as in the present case. Thus there is no 
logarithm of an integer between 0-01 and 0-25; none between 0-61 and 0-65 
and between 0-71 and 0-75. On the other l]and, there are two between Ml 
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and 1-16; two between MB and 1-20; only one between 1-21 and 1-25 and three 
between 1-26 and 1-30. Thus in all three diagrams the frequency of 1-21—1*25 
is well down and that of 1-26-1-30 is far up, Some process of smoothing would 
undoubtedly eliminate these irregularities, but it was thought better to leave the 
data in their original form and draw attention to the sources of irregularity. 

(2) There appears to be on all three diagrams a slight shortage of high values 
and a slight excess of low ones. On the latter I have no comment but it appears 
possible that the small deficit in the longer sentences might easily be due to a 
biased effect introduced by the habit that many writers have of cutting up 
unusually long sentences into their component parts when reading over their 
manuscript or proofs. 

On the assumption that the normal curve is a sound representation of the 
frequency distribution of the log number of words in sentences, Tig, 6 has been 
prepared which shows the means and normal curves for the three books super¬ 
imposed on one another. The means are close together but the distributions are 
very different. 

The difference in means between Shaw and Wells is 0-09 and the standard 
error of the difference is only 0-015. Thus the difference is six times the standard 
error and hence certainly significant. Between Shaw and Chesterton the 
difference of the means is barely significant but that between the standard 
deviations is quite striking. 

If the above reasoning is correct, it is unnecessary for the comparison 
between two documents to compare arithmetic means, medians, quartiles and 
deciles, but only the log mean and the log standard deviation; all other com¬ 
parisons are included in these. 

It follows also that Mr Bernard Shaw, while undoubtedly under the im¬ 
pression that ire was punctuating at his own free will, was for this particular 
book hide-bound within the limits of 

r/ 1 r(l-4-m) 2 “| 

J 0-29 V(2ff) eXp L 2(0-29)* 

while similarly Mr Wells was writing under the restricting influence of 

1 ^ [(1-3-se)*- 

J 0-24 V(27t) GXP L 2(0-24)2 J ’ 

where Z is the frequency and x the logarithm of the number of words per 
sentence. 

It is also perhaps worthy of passing comment that the curve representing 
Mr Shaw is short and broad while that representing Mr Chesterton is tall and 
slender; which shows how necessary it is to use these curves only for the purpose 
for which they were originally designed. 

Perhaps something might be added on the meaning in words of the above 
mathematical transformation. If the log distribution is normal we can infer 
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that the extent to which a sentence in the process of writing is likely to vary is 
at any level proportional to the length of the sentence. Thus when he is thinking 
in short sentences of about 10 words an author is likely to vary say from 8 to 
12 words; when he is thinking in longer sentences of say 100 words he will vary 
from 80 to 120. In other words the variations are proportional or geometric 
and do not merely involve the addition or subtraction of x words at all levels. 
Further, if the geometric mean is taken as a basis, sentences between this and 
half its length are as frequent as those between it and twice its length; sentences 
down to one-quarter its length are as likely to occur as sentences up to four 
times its length; and so on. 

If the arithmetic mean were the true basis then sentences of 10 words more 
than the arithmetic mean would be as likely to occur as sentences of 10 words 
less, and it is easy to see that this is not the case. 

Before the whole theory of the use of such distributions for separating works 
of different authorships can be fully accepted it will of course be necessary to 
study the results obtained from many different works by the same author, in 
different styles, on different subjects and at different periods of his life. From 
these it may be possible to find what variation can occur “within authors” as 
oompared with “between authors”. This note is not meant to deal with this 
basic problem but only to draw attention to the simplification of the method 
of approach to such a problem by the use of a transformation which produces 
a normal instead of a skew distribution. 
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APPLICATIONS OF THE NON-CENTRAL ^-DISTRIBUTION 

By N. L. JOHNSON and B. L. WELCH 


1. Introduction 

Statistical problems arising in connexion with the normal distribution are 
simplified by the circumstance that the sample mean x and standard deviation 3 * 
are jointly sufficient estimates of the corresponding population parameters £ and 
tr, Also the distributions of x and s have simple forms and together with Student’s 
^-distribution provide complete solutions of any problems of testing hypotheses 
or of estimating fiducial limits relating to either £ or <r singly. In the present paper 
we shall consider some of the questions which arise when our main concern is not 
with either £ or a alone, but with some function of the two. In particular we shall 
consider a number of cases which all lead to the use of what has been called the 
non-central ^-distribution. Tables of the probability integral of this distribution 
will be given in a form suitable for the solution of the problems raised. 


2. The non-central ^distribution 

Let z be a quantity distributed normally about zero with unit standard 
deviation and let w be a quantity distributed independently as y 2 //, where/ is the 
number of degrees of freedom of the y 2 , Then if t is defined by the equation 

2 + J 


t- 




.( 1 ) 


where 8 is some constant, then t is distributed in a manner depending only on 8 
and/. This distribution will be termed a non-central ^-distribution. When 8 equals 
zero we have the familiar Student’s l. 

In general the elementary probability distribution of t is given by 

1 


P(0 


2 


r w (M L ,( 

/ \«/+i) 


li \2 
C(/H>)7 ( l v 


/! 


where 


Hh t (x) = 


l /£_ 

e ir+t 'l-r 


(?+TV 


Eh 


tt__ \ 


/! 


gi(-o+x)» fl Vt 


.( 2 ) 


.(3) 


An account of some of the properties of the above-defined Hh function has been 
given by R, A. Fisher (1931). He derives a result equivalent to the above equation 
(2) although with a different notation, his t being equivalent to our </// and his 

* s a = r(*-s) J /(n-l), where n is the sample size. 
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r being our £/./(/+1). Tables of the Hh function have been calculated by J. R. Airey 
(1931). These tables, however, are not useful for solving the pioblems which are 
considered in the present paper. For these problems the probability integral 
of the non-central ^distribution is required. Existent tables of this integral 
(J. Neyman, 1935 and J. Neyman & B. Tokarska, 1936) have been calculated only 
for one rather restricted purpose. In the present paper much more extensive 
tables will be provided which, it is hoped, will cover all the applications of the 
non-central f-distribution likely to be encountered. 

The probability integral of non-central t demands a table of triple entry, since 
the probability that t exceeds < 0 , say, depends on/, 8, and t 0 . The notations 

P(f,8,t 0 ) = P(t>t 0 \J,8) .(4) 

will be used to denote this probability. 

Often it is necessary to find what value of t 0 is such that the probability (4) 
will take a specified value e, say. This t 0 will be a function of/, 8, and e and it will 
be convenient to denote it by <(/, 8, e). Thus 

P{t>t(f,-8,e)\f,8} = e. 

Again, often will be given and the value of 8 which makes (4) take the value e, 
will be required. It will be convenient to denote this value of 8 by 8(f, t 0 , e). Thus 

P{i>t 0 \f,8(f,t 0 ,e)} = e. 

Space does not permit the tabling of all the three functions P(/, 8, i 0 ), t(f, 8,e) 
and 8(f , < 0 , e). For reasons which will become clear later in the paper it was decided 
to table 8(f > t 0 , e) most fully. Table IV, given at the end of the paper, facilitates the 
direct calculation of 8(f, t 0 , e) for seventeen probability levels e. It can also be 
used without difficulty for calculating <(/, 8, e). Table V is an additional short table 
from which t(f, 8, e) can be calculated rather more directly but only for the 
probability levels e = 0-05 and e = 0-95. These tables are given in the most suitable 
form which we have been able to evolve consistent with necessary limitations on 
size. 

Before the tables are described, the next three sections will be devoted to a 
description of some of the situations where they are required. 

3. Coefficient of vaeiation 

The first function of £ and a to be discussed will be the coefficient of variation 
V - <r/g. In the practical situations where this index is an appropriate measure 
of variability, the variable x is usually necessarily positive. Now for a normal 
population the ratio of the mean to the standard deviation has to be of the order 
of 3 or more, for the chance of a negative x to be negligible. Strictly speaking, 
therefore, the sampled population should not be assumed normal if the coefficient 
of variation is too great. The figure 33 % has often been stated as the permissible 
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upper limit. In practice coefficients of variation are usually much smaller than 
this, and it is assumed in the present section that we are dealing with such cases. 

An estimate of V is provided by the sample coefficient of variation v = sjx. 
Now since we may write 

fn __ fn x = l-J njx- ij) + fnj\ ^s_ 
v s { cr (r j ' cr . 


it appears from comparison with (1) that fn/v is distributed as non-central l with 
/ = ( n ~ 1) and 8 = fn/V (McKay, 1032). The solution of problems relating to V 
is therefore easily effected. 

For instance, suppose it is desired to test whether the sample contradicts the 
hypothesis that F< F 0 , and that we decide to reject the hypothesis when v > v n> 
where v 0 is chosen so that P(v > v 0 ] F = F 0 ) is equal to some specified small chance 
e. In the notation of the previous section v 0 will then be given by 


Jn 


t\n- 


1 & T 

’F 0 S 


Again, consider an example where it is decided to reject a sample as unsatis¬ 
factory when v is greater than a given value v 0 , and where it is required to know 
how low the true coefficient of variability should be kept to ensure that the 
probability of rejection will not exceed a given e, i.e. we require F 0 such that 
P(v > v 0 1 F = F 0 ) = e. In the notation of Section 2, F 0 is given by 




~8 




( 7 ) 


Or finally suppose that a value of v is observed and an upper fiducial limit of 
F is required so that the chance is e of this limit being exceeded, i.e. a lower 
fiducial limit of fn/V is required. Now since 

(V m . 




l - 7 

1 , y , 


= 6 


and since the inequality 


fn 


>t\n 


v ’ 


* 1, rr , e 


is equivalent to the inequality* 


fn 

V 




J 


it is seen that the required upper fiducial limit of F is 



(8) 


* This follows from the fact that t[f, S, e) is a monotonioally increasing function of <1. 






N. L. Johnson and B. L. Weloi-i 


365 


4. The power of Student’s /-test 

Suppose it is desired to test the hypothesis H 0 that the mean £ of a normal 
population has the value £ 0 . Student’s /-test consists in calculating from the data 
the quantity (= >(»-{,) . 

«s 

and referring it to the usual central /-distribution with /= (n-1) degrees of 
freedom. Thus if the only alternative hypotheses to he taken into consideration 
are those for which £ > £ 0 , the test will consist in rejecting the hypothesis when 
t>t Q , where t a is such that P(t > /„ \ H n ) = e, and e is the conventional level of 

significance. In our notation _ 

< 0 =/(w-1, 0, e). .(10) 

Now when H 0 is not true, hut £ has some alternative value £ 1; it is often required 
to know how powerful Student’s test will be, i.e. what chance the test will have of 
rejecting H 0 . But we can write (9) in the form 

t _ ! >(*-&) , >(gi-6> )j. * .(11) 

whence, comparing with (1), it now appears that the quantity calculated from the 
data is distributed in the non-central /-distribution with / = (»-1) and 

s = (£i~£o )l cr - 

The power of the test is therefore P(n- 1» V 71 (Si ~^l cr >h) an( ^ va ' UR 
for which the power reaches any specified value ?/, say, is given by 

. ( 12 ) 

cr 

Tables for evaluating the power of the /-test together with a discussion of their use 
have been given by J. Neyman (1935) and J. Neyman & B. Tokarska (1936). 
They are not restricted to the simple case of testing whether a mean has a 
specified value but apply to all cases in which the t- test is used. 


5. Proportion defective problems 

Another class of problem where the non-central /-distribution has an applica¬ 
tion, occurs when objects are classified as defective or non-defective according to 
whether they have values of a characteristic exceeding or falling short of a fixed 
standard. Thus, if an object is defective when the character x exceedsa fixed given 
level L, then information will often be required about the proportion P falling 
beyond L in the population. If the population be normal P clearly depends only 


on the ratio 


{L-Q 
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An estimate of U is provided from the sample by calculating 

(L-x) 

u = ---. 

s 


(13) 


Corresponding to deviate u the equivalent normal probability p will then give 
an estimate of P. This situation has been discussed recently by W. J. Jennett & 
B. L. Welch (1939).* The transference from V to P and front u to p does of course 
require that the sampled distribution is really approximately normal and for 
this reason care must be taken ingoing out to the “tails” of the distribution 
(i.e. when U is large and P small). 

In order to allow for the sampling errors arising from this method of estimating 
P we have only to note that (13) may be written 


<]nu = 


\fn (L- g) _ fn (5 -g)' 
cr cr 


s 

cr' 


(14) 


Hence fnu is distributed as non-central t with / = (n- 1) and 8 = fn U. Thus if 
the proportion P (and hence U) wore known the value w 0 of u such that P [u > u 0 ) = e 

would be given by _ 

Jn u 0 = t(n-l,JnU,e). .(15) 


Conversely, given u from a sample, a lower fiducial limit of U is obtained by 

noting that the inequality _ 

fnu>t(n-l,jnU,e) 

is equivalent to the inequality 

fnU <8(n-l,<Jnu,e). .(16) 

Hence a lower fiducial limit for U will be given by 8 (n - 1, fn u, e)j*Jn. In the above 
analysis the proportion falling beyond L has been termed a “proportion de¬ 
fective”. In industrial problems this is often a fair description. More generally 
the analysis refers to any problem where we are primarily interested in how a 
measurable object is related to a fixed level, A slightly different problem occurs 
when we are not given L but are asked to estimate the value of L so that P will 
have an assigned value. Bor instance, the value of L may be required so that the 
chance is 1 in 20 of it being exceeded. 

Now if U is the normal deviate which is exceeded with probability P, then the 
required L is related to £ and cr by the relation 


L = £+ Vc r. 

An estimate of L is therefore given by 

l-x+Us. .(17) 

* For a general discussion of similar problems arising in the control of industrial products the 
reader may be referred to the British Standards Institution Publication, No. 600 (E. S. Pearson, 
1935). 
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If fiducial limits are required for L they may be obtained by returning to equations 
(13) and (15). L is not now given, but U is definitely known and therefore (15) 
provides the value of u 0 such that P(u > u 0 ) = e. But the inequality 

L-x 

u =-> u n 

s u 

can be reversed to read L > x + u 0 s. 

Hence, if we write l e = x + S —— * >"J n e) 

V w . v 

we shall have P{L>l e ) = e. Hence l e will be a fiducial limit for L, which will be 
exceeded by L in a proportion e of cases. 


6. Large sample results 


Before going on to discuss the distribution of non-central t in general and the 
method of applying the tables given below, it will be convenient to consider the 
situation when/is large. The distribution then approaches the normal form. The 
important question is how quickly. 

The first three moments of t are given exactly by the following expressions: 



/6 2 — 


/(1 + * 2 ) 
(/-2) 


-Pi> 


f l 3 — A 6 ! 


7(2/-3+ S 2 ) 
,(/ : 2)(/-3) 



(19) 


.( 20 ) 


( 21 ) 


If the gamma functions are expanded in powers of 1// and only the leading 
terms are retained, these give approximately 

~ 55 2 1 

u 


p'i - 8] H = 1 + 


8 1 

' 2 / 


sr 5^1 

/is=Z fL + 1/ ; ^ x ~ 


3-b 


4/. 


L i + 2/. 


"IS 


.( 22 )’* 


In large samples, therefore, t becomes normally distributed about 6 with standard 


deviation 11 + , The rapidity of the approach to symmetry is indicated by the 

true values of in Table I. For given / the greatest value can take is shown 

in the last column. 

* The term <5 2 /'/ is retained because in most problems 5 will be of the order //. 


24-2 
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TABLE I 

VAi °f t f or different f and, 8 


\j/df 

f\ 

\ 

0 

1 

2 

3 

00 

4 

0-00 

3-70 

4-61 

4-86 

(5-09 

6 

0-00 

1-67 

2-13 

2-26 

2-38 

8 

0-00 

1-20 

1-55 

1-65 

1-74 

12 

ooo 

084 

MO 

1-18 

1-25 

24 

(MX) 

053 

0-69 

0'74 

0-79 

00 

0-00 

0-00 

0-00 

0-00 

OOO 


It is seen from Table I that values of ffi of the order of unity are likely to 
occur fairly frequently in the type of problem which has been discussed in the 
previous sections. This represents a considerable degree of skewness and another 
method of making normal approximations, outlined below, is preferable. Before 
describing this we may note that if t is referred to a normal scale with mean 8 


and standard deviation 



then tire following approximations to the 


quantities defined in section 2 will be obtained: 


t{fA,e) = 8+K 




.(23) 


where K e is the deviate of the unit normal curve exceeded with a probability e. 
Also 8{f, t 0 , e) will be given by the solution of the equation in 8 


t 0 = 8+K 



(24) 


On rationalization this equation is quadratic in 8 and the correct root to take will 
be obvious. 

Better approximations, which are, however, still based on the normal ourve 
may be obtained as follows. The probability that ( exceeds a given value < 0 is 
by (1) the probability of the inequality 


t - + > t 

Jw >to ’ 


.(25) 


and this inequality is equivalent to 


{~z + t 0 fw)<8. . ( 26 ) 

Now (- z) is a unit normal deviate, and *Jw, being of the form x/ff is very nearly 
normally distributed even for small/. Since z and fw are independent, (-z + t 0 fw) 
must therefore be more nearly normally distributed than fw, whatever tf 0 .* 
* This is practically obvious, but a demonstration is given in tha Appendix, equation (43). 
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Hence an upper limit to the skewness of (- z + t n 4 w ) is given for different/by the 
values of 4k in Table II, Comparison with Table I shows that it is better to take 
(~z + t 0 ^jw) as normally distributed rather than t. The procedure is then as follows: 


Write 


mean^/w = a; a^ w = 


V(2/r 


.(27) 


Then (- z +1 0 *Jw) is taken to be normally distributed with mean at 0 and standard 
deviation 1 + ~ j . Tor given t Q the value of 8 such that P{t>t 0 ) equals e is given 
by seeking 8 such that the probability is e of (26) being true. This gives 


<*(/, h, e) = at a -K e 



(28) 


TABLE II 

4k of 4 W f or different f 


/ 

4 

6 

8 

12 

24 

00 

4h 

0-41 

0-32 

0-27 

0-21 

0-15 

0-00 


Conversely for given 8 the value t(f, 8, e) which will be exceeded with probability 
e, will be approximated by solving for t the equation 

8=.at-K £ {l+°-k} . .(29) 

On rationalization this becomes a quadratic in t. This method of approximation 
was given by W. J. Jennett & B. L. Welch (1939) together with a short table of 
a and b. However, since a and b differ from unity by a quantity of the order 1// 
and since errors at least of this magnitude are involved in the assumption of 
normality, it would perhaps have been as logical to take a and b equal to unity 
in the approximation. The numerical comparisons set out in Table III indicate 
that nothing is lost by doing this and also show how inferior is the method of 
approximation which assumes t itself to be normally distributed. 

Therefore the approximation which we shall adopt as being the best one, 
requiring only the use of the normal probability scale, will consist in assuming 
(~z+t 0 fvj) to be normally distributed about t 0 with standard deviation 



The value of 8 such that p(t > t 0 ) equals e will then be approximated by 
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TABLE III. 


Values of Sjff 


I 

e 

^0 

dr 

<W/ 

True. 

Approx. 1 

Approx. 2 

Approx. 3 

4 

0-99 

2 

0-05B 

0-736 

-0-088 

-0-015 

4 

0-01 

2 

3-972 

12-023 

3-848 

4-015 

4 

0-99 

5 

1-120 

2-008 

0-505 

0-726 

4 

0-01 

0 

9-247 

29-230 

8-840 

9-274 

24 

0-99 

2 

1-181 

1-345 

0-950 

1-178 

24 

0-01 

2 

2-819 

3-103 

2-800 

2-822 

24 

0-99 

6 

3-290 

3-677 

3-213 

3-255 

24 

0-01 

5 

0-752 

7-593 

0-087 

0-745 


The values correspond to certain /, e and Ijjf and have the property that p{l>t 0 \f, S) = e. 
Approximation 1 is the value obtained from equation (23), approximation 2 by using (28) with the 
correct a and l> and approximation 3 by using (30), i.o. taking a =» b = 1. 


Conversely, for given 8, the value <(/, 8, (■:) which will he exceeded with probability 
e will be given implicitly by 


8 = 



.(81) 


which on rationalizing and solving becomes 



(32) 


The approximations (30) and (32) are useful even in moderate-sized samples. 
It is not intended, however, to enter here into any detailed discussion of how good 
the various approximations are, They have been considered primarily because 
they form the basis of the exact tables which are given at the end of the paper and 
which are described in the next section. 


7, Description and use of tables 

The tables are intended to facilitate the exact calculation of the quantities 
S(f, < 0 , e) and <(/, 8 , e) defined in Section 2. The form in which they are presented 
was determined largely by the fact that the space they had to occupy was limited, 
and also to some extent by the method which was adopted to calculate them. 
Table IV may be used to give 8(f, t 0 , e ) for 17 probability levels, viz. 

e = 0-005,0-01,0-025,0-05, 0-1, 0-2, 0-3, 0-4, 0-5, 0-6, 0-7, 

0-8,0-9,0-05,0-976,0-99 and 0-995. 





N. L. Johnson and B. L. Welch 371 


The basis of the table is the large sample approximation described in the last 
section, equation (30). The K s in this equation is simply the normal multiple 
which is exceeded with probability e. In small samples an error will be committed 
in taking this form of approximation, and in Table IV we give instead a multiple 
A (/, if 0 , e), which is such that exactly 

8(f> ^0 ) £ ) = h — Mf> ^o> e ) ' .(33) 

Even for samples giving/ as small as 4, it is found that A never differs much from 
K e . This stability makes interpolation easy so that, for given e, it is possible to 
provide a table of small compass covering all values of i 0 from — oo to oo. 

For given e the tabled quantity A is a function of / and < 0 an d accordingly 
Table IV, for given e, is one of double entry. 

Values are given corresponding to / = 4,5, 6, 7, 8, 9, 16, 36, 144 and oo. The 
reason for choosing these values of / is that A differs from K e by a quantity of the 
order 1 ///. The values/ = 9,16,36,144 and oo are such that 12/// = 4,3,2,1 andO 
respectively. Hence interpolations for intermediate/’s may be simply effected by 
considering A as a function of 12/,//, since we are then dealing with a function 
tabled at equal intervals. 

The choice of the values of t 0 in the table is also determined by the necessity 
for interpolation to be simple and yet the table not to be unduly large. The 
whole range of t 0 from - oo to oo has to be covered. This has been done as follows. 
For i 0 //2/ between ~ co and — 0-75, A is given against the quantity 


y = 



.(34) 


For t 0 //2/ between — 0-75 and 0-75, A is tabled against 



For f 0 //2/between 0-76 and oo, A is again tabled against y. The argument interval 
for both y and y' is 0T, It will be noted that y -{l -«/' 2 )*. 

From the point of view of interpolation other, perhaps simpler, functions of 
tg could have been used in constructing the table. The choice of y was ihade 


because the quantity 


i+5 

v 


is wanted in any case for substitution into equation 


(33) after A has been obtained, y' had to be used in addition because of inter 1 - 
polation difficulties in the middle of the table. 

It will be noted that in Table IV only the double entry tables fore = 0-005,0-01, 
0-025,0-05, 0-1, 0-2,0-3,0-4 and 0-5 are given. For e > 0-5 we can use the fact that 


m o,e) = -3(/,-<o,l-e), .( 36 ) 


a relation which is apparent from the form of equation (1). 
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To summarize, the steps necessary to find the value of S such that P(l >t B ) = a 
are as follows. 


(i) Finely = (1 + ^) ' or 


y pA w 


according as 


PI 


is greater than 


or less than 0-75. 

(ii) If/> 9 calculate 12/ff. 

(iii) Ifeis one of the values 0-005,0-5, enter the appropriate part of Table IV 
and obtain A (f,t 0 ,e), by interpolating with respect to the quantities obtained in 

(i) and (ii). 


(iv) Calculate 


S{f,l 0 ,c) ~ t Q -^ 1 + 




w • 


(v) Ifeis one of the values 0-0,0-7,..., 0-!)f)5 calculate 8{J, ~t 0 , 1 - e) arid then 
change its sign. 

Inverse use, of Table IV 


Next consider the calculation of t(f, d, e), i.e. the situation whore d is given and 
f 0 is required so that P(t > < 0 ) = o, In the previous section it was shown that a 
first approximation will be given by equation (32). The true relation between 
t(f, 8, e ) and d will, however, be obtained by replacing K t in (32) by the A which is 
tabled in Table TV; thus 





.(37) 


The drawback about this equation is that A is given in Table IV as a function of 
t B , and t 0 is not now known. An iteration method is therefore necessary, 

The successive steps in this iteration may be summarized as follows: 

(i) The first approximation q is given by (32); thus 




(ii) Next use Table IV to calculate directly 


Ai= A(/,q,e). 

(iii) A second approximation l 2 is then given by substituting this A x in (37); 
thus 
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(iv) Then by finding A 2 = A(/, < 2) e) and substituting in (37) a third approxima¬ 
tion < 3 will be obtained. These steps must be repeated until two successive 
approximations give the same value. In practice this is found to occur very 
quickly so that there is likely to be no need of more than three approximations. 

(v) If e is one of the values O'6, 0-7,..., 0-995 calculate^/, - 8, 1 -e) and then 
change its sign. 

Use of Table V 

The iteration process described above is not very lengthy in actual practice, 
but even this amount of trouble would be unnecessary if A were tabled as a function 
of 8 as well as a function of t . Such a table, similar in extent to Table IV, could of 
course be calculated, but it did not seem worth while to do this, since the inverse 
method of using Table IV is not difficult. However, it has been thought useful to 
give for the single probability level e = 0-05, a table which can be entered with 
8, since very often this conventional 1 in 20 chance is taken as a point of reference 
in statistical problems. 

In Table V, A is tabled against 



at intervals of 0-1 from -1 to 1. When 8 is given, this function of <5 may be quickly 
calculated and then A may be obtained by interpolating in Table V. The substitu¬ 
tion of this value of A in (37) gives the required t(f, 8, 0'05). As before t(f, 8, 0-95) 
may also be calculated from the same table, being equal to minus t(f, -8, 0-05). 


8. Examples 

In the present section some numerical examples will be worked to illustrate 
the application of Tables IV and V. Reference will be made to theoretical results 
obtained in Sections 3-5. 

Example 1. In a sample of n = 25 a coefficient of variation v = 2 -6 is observed. 
Obtain from this a “median” estimate of the population coefficient V. By a 
“median” estimate is meant one for which the chance of it exceeding the true 
V is 0-50. In other words our estimate is the 50 % fiducial limit. Hence from (8) 
the required estimate is 

V25WJo.O'5), .(39) 

where = 10231 ancl 24, « 

H )‘=" 03,91 ^wiH)^ 0 ' 2674 - 

12/^//= 24495. 


We have 
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From Table IV, for e = 0-5, we obtain by linear interpolation* with respect to 
y' and 12/ff, A = 0-0197. Hence from (33) 

5 = 1-0281-(0-0197) (1-0370) = 1-9027. 

Therefore from (39), v m - 2-628. 

Example 2. A sample of n = 10 measurements of a normally distributed 
character x is given. The mean x is 4-7 and the standard deviation s is 0-2. Obtain 
10 % fiducial limits for the proportion P in the population exceeding x = 5-0. 

As is shown in Section 5 this is equivalent to finding limits for U = (5*0 - £)/<r. 
We calculate first u - (5*0-5:)/s = 1*6. Substituting this into (16) the upper and 
lower 10 % points for U are given by <5(9,4*743,0*9)/ A /10 and 5(9,4*743,0-l)/ A /10. 


We have 


1 + 


4 

Vi 


1 * 6 ; 



0-6667. 


From Table IV for e = 0-1 we obtain by linear interpolation* 

A(9,4*743,0*1) = 1*347, 

and hence from (33) 

<5(9,4*743,(1-1) = 4*743-(1-347) (0*6667) - 2-723. 


To obtain <5(9,4*743,0-9) we note by (36) that this is equivalent to minus 
<5(9, -4*743,0-1). The corresponding A is obtained by entering the same table as 
before, with the same value of y, but in the part of the table for which 4 0 is negative. 

We obtain A(9, -4-743,0-1) = 1*197. 


Hence <5(9, -4*743,0*1)= -4*743-(1*197) (0*0007)= -6*538, 


and therefore 5(9,4*743,0-9) = 6*538. 

The upper and lower 10 % limits for U are therefore 6-538/^/10 and 2*723/^/10, 
i.o. 2*068 and 0*861. The corresponding limits for P are 0*019 and 0*195. 

Example 3. Suppose it is required to estimate fiducial limits for the value L 
of x in the normal population which is such that tho proportion of the population 
exceeding L is 10 %. Consider tho ease where the sample size is n ~ 10 and the 
limits required are the 5 % limits at each end. From (18), we require 


where 


x+k 0 . 05 s and a + fco-os'-'O 
, 4(9,^1017,6) 

V 10 ’ 


(40) 


and U is the normal deviate exceeded with probability 0*1; i.e. U - 1*2816. We 
therefore require f(9,4*053,0*05) and 4(9,4*053,0*65). In the first place these will 
be derived from Table IV by the inverse method described above. 


* For a note on the accuracy of linear interpolation sec the end of Section 9. 
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Following out the successive steps to derive i(9,4*053,0-05) we obtain 
4-053 + (1-6449) (1 + 0-9126-0-1503)* 


(i) t x = 


(1-0-1603) 


7-340, 


(ii) y x - (i + |j * = 0-5004; A t = A(9, 7-340, 0-05) = 1-6803, 

,«« , 4-063 + (1-6803) (1 + 0-9126-0-1569)* „ 

(m) h = (I^0T569)--- = 7 448) 

(iv) y t = jl +|j 3 ' = 0-4950; A 2 = A(9, 7-448, 0-05) = 1-6800, 

1 3 = 7-447. 

To obtain f(9,4-053,0-95) we require first 1(9, -4-053,0-05). The successive 
approximations to this quantity are 


- 4-053 + (1-6449) (1 + 0-9126 - 0*1503)* 
(1-0-1503) 


2 - 200 , 


(ii) y[ = ^j( 1 + !) i= - 04602 ; A i= A ( 9 > - 2,200 > °-° 6 )" 1>5933 > 

(iii) 1 2 =-2-249, 

(iv) y' t - JjL(l +^J~* = -0-4684; A 2 = A(9, -2-249, 0-05) - 1-5925, 
1 3 =- 2-250; 


1(9,4-053,0-95) is equal to minus 1(9, - 4-053, 0-05) and therefore equals 2-250. 
Hence from (40) 

7-447 , 2-250 

~ 2-3 ^ 9 ’ K‘%- ^ 10 =o-7i-. 

The above calculations have been performed using Table IV and the inverse 
method described in the previous section. Actually this example could have been 
treated much more simply by Table V. We shall proceed to show this, but it 
must be remembered that Table V only covers cases where e = 0-05 and e = 0-95. 
For other probability levels Table IV will have to be used. 

To obtain 1(9,4-053,0-05) and 1(9, - 4-053,0-05) Table V has to be entered with 

R / JMS \ —i 

n'= - 1 + -J , where A has the two values + 4-053, i.e. has to be entered with 

m\ -v/ 

ri' = ± 0-6908. These give values of A equal to 1-6800 and 1-5925 and are the same 
as the final A’s obtained in the iterations. We have then only to perform the 
calculations for 1 by substituting in (37). 
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0. Summary 


An account of some of the applications of the non-central ^-distribution* has 
been given, Tables of the distribution have been provided together with numerical 
examples of their use. 

If f 0 and S are connected by the equation P(t>t 0 \8) = e, then it was found that 
a good approximation to 8, given t 0 , is 

»-v-*,(i+! )*, 

where K, is the normal deviate exceeded with probability e. The equivalent 
approximation 



is good for calculating < 0 given 8. These approximations were seen to have ad¬ 
vantages over other approximations also based only on the normal distribution, 
Tables IV and V at the end of the paper make possible a more exact determina¬ 
tion of 8 given t 0 and t 0 given 8, respectively. In these tables a quantity A„ is given 
such that 


and so 




i+A,(l + ^- 

A?y 

/, A h 





Note on the accuracy of the tables. In the greater part of the tables A„ is 
correct to as many figures as are shown, Occasionally, however, the values of 
may be almost 2 units wrong in the last figure, It was nevertheless con¬ 
sidered worth while to give all the figures shown. 

Throughout Table IV linear interpolation will always give a result not more 
than a \ unit wrong in the second last figure, This is also true of Table V, 
except between y' = -08 and - l'O for/ = 9 and 16. Here linear interpolation 
with respect to y' may be 1 unit wrong in the second last figure. 

* i=(z-fS)/v'w, where «is a unit normal deviate, w is distributed independently as xVf> an( l / 
is the number of degrees of freedom of x i ■ 
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TABLE IV ( continued ) 
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TABLE IV ( continued) 
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TABTjE IV (continued) 
Values of A(/, t a , e) 
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TABUS IV {continued) 
Values of A(/, t 0 , e) 
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TABLE IV ( continued ) 
Values of A(/. t a , e) 
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Applications of the. non-central t-distribution 


TABLE V 

Values of A as a function of 8 

c--=0*05 


\ / 

4 

6 

(> 

7 

8 

0 

16 

36 

144 

CO 

m 

1*630 

1-048 

1*656 

1-660 

1*662 


1*0665 

1*6634 

1-0559 

1*6449 

0*9 

1*043 

1-665 

1*662 

1-006 

1*608 

1*6695 

1*6711 

1*6667 

1-0576 


0*8 

1*650 

1-062 

1*668 

1-072 

1*074 

1*6747 

1*6751 

1*6091 

1-6580 

1*6449 

0‘7 

1*657 

1-688 

1*074 

1*677 

1*679 

■C4! il 

1*6782 

1*6707 

1*6589 

1*0449 

0*6 

1*664 

1*675 

1*680 

1*682 

1*684 

IE ft 

1*6804 

1*6714 

1*0587 

1*6449 

0*5 

1*671 

1*681 

1*086 

1*687 

1*687 

1-6871 

1*6817 

1*0709 

1*0580 

1*0449 

0-4 

1*679 

1*687 

1*690 

1*691 

1*601 

1-0896 

1*6816 

1*6008 

1*0508 

1*6449 

0*3 

1*687 

1*693 

1*694 

1*093 

1*692 

1-6902 

1*6804 

1*6677 

1*0550 

1*0449 

0*2 

1*693 

1*697 

1*696 

1*694 

1*692 

1*0898 

1*6779 

1*0646 

1*6529 

1*0449 

0*1 

1*698 

1*699 

1*097 

1*693 

1*600 


1*6738 

1*0606 

1*6504 

1*6449 

O'O 

1*703 

1*699 

1*695 


1*680 


1*6682 

1*6558 

1*6477 

1*0449 

- 0*1 

1*702 

1*095 

1*080 

1*084 

1*079 

1*6756 

1*6611 

1*6503 

1 * 6*147 

1*0449 

-0-2 

1*608 

1*088 


1*074 



1*6525 

1-0442 

1*6417 

1*6449 


1-687 

1*676 

1*007 

1*661 


1*0535 

1*6427 

1-0378 

1*0388 

1*0449 

- 0*4 

agiwti 

1*058 

1*050 

1*646 

1*642 

1*6391 

1*6322 

1*6314 

1-0359 

1*6449 

- 0*5 

■ELI 

1*636 


1*627 

1*024 

1*0231 

1*6213 

1*0252 

1*6334 

1*0449 

- 0*6 

1*616 

1*610 

1*007 

1*006 

■SMl 


1*6108 

1*0195 

1*0313 

1*6449 

- 0-7 

1*582 

1*583 

1*685 

1*587 

1*680 

1*5911 

1-6019 

1*0150 

1*0299 


- 0*8 

1*551 

1*559 

1-605 

1*571 

1*575 

1*5702 

1-6954 

1*6122 

1-6291 

1*6449 

- 0-9 

1*531 

1*544 

1-553 

1*501 

1*567 

1*5722 

1-5925 

1*6110 

1*0292 

1*6449 

-1*0 

1*528 

1*543 

1-654 

1*563 

1*569 

1*5744 

1*5952 

1*6141 

1*0307 

1*6440 



For note on nocuracy of table see end of Section 9, 


APPENDIX 

On the. calculation of Table IV 

Before deciding on the method to bo employed in calculating the tables a wide variety of 
methods wore considered. Wo examined those with a view to finding one which would 
best combine the attributes of speediness, lack of opportunity for numerical errors, and 
adaptability to moderately large-scale‘work. In this appendix will bo described the more 
promising of the methods tested, including the one finally adopted, 

The non-central f being defined as in (1) we have for the probability that t>t 0 , with the 
notation of Section 2, 

'**» - . <"» 

v7 

Wo want to find pairs of values 8 and f 0 such that P(f, 8, <„) has specified values e; i.e, we 
want to evaluate the quantities <„(/, 8, e ) and 8{f,e). 
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(i) Direct Methods 

These are based on the idea of evaluating the right-hand side of (41) direotly for various 
values of 8, /, and t 0 and then obtaining the quantities required by a process of inverse 
interpolation. An obvious way to evaluate P(f, 8, Q is by quadrature. For small even values 
of/, however, the following formulae: 


P(0<t<co|/, 


1 C a 


e-W-W-du, 


K/-2) 




1 f 00 

whero Hh,lx) = -\ u’e-^^du 

*U 0 , 

were found to be more convenient to use. The Hh, functions have been tabled by J. R. Airey 
(1931). 

This modified procedure was still unduly lengthy and not well adapted to large scale 
work. It provides, however, a useful cheek on values obtained by other methods. 


(ii) Solution by means of a differential equation 
For fixed values of/ and e the equation P(f, 8, t a ) - e may be regarded as specifying 8 and 
4 as implicit functions of each other. This implies that for constant e, 


dto _ _ f 8P(/,J,*,) / 8P(f,8,t a ) \ 
d8 ~ \ 88 / 0t u j 

From (41) this is found to give 

1 -- r I. \ Hhf-li®) 

where JhK&T 

It may be shown that J,(x) = fJ](x) - xJ f (x) -1. 


.(42) 


. m 


For given values of/ and e, the value of 4 corresponding to 8 = 0 can be found in a straight¬ 
forward manner since it is simply a probability level of Student’s t. Starting from this 
initial value we can solve the differential equation (42) numerically and obtain the values 
of < 0 corresponding to non-zero values of 8. A trial of this method was made, taking / = 6, 
and e = 0-05. The differential equation was solved by the Adams-Bashforth process. The 
values of higher derivatives of t with respect to 8 at the point 5 = 0 are required in this 
process. By means of (43) the formulae for these quantities were obtained in quite a simple 
form. 

This method gave promising results—it was suitable for large-scale work and there was 
comparatively little chance of inaccuracies entering the work—but it suffered from the 
defect that we could proceed to build up our table by small increments only of 8 (actually 
increments of 0-05). As we wanted to be able to deal with large as well as small values of 8 
we decided that the process was not sufficiently speedy for our purposes. 

It may be remarked, however, that a similar type of approach might be quite suitable in 
cases where the expression for the differential coefficient is simpler than was ours. 
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Applications of the non-central t-distributinn 


(Hi) Solution based on normal approximation 

The method finally adopted for calculating tho greater part of the table is based on the 
use of Edgeworth’s development of the normal probability function. In a recent paper 
E. A. Cornish & R. A, Fisher (1937) have given formulae, derived from this development, 
which make it possible to find percentage levels of a probability function when the oumulunto 
of tho function are given. These formulae arc of most practical use when the probability 
function is itself nearly normal in form. With markedly non-normal distributions, the 
development may not converge, or at least more terms may bo required than is practically 
convenient, 

The cumulants of the non-central /-distribution are not difficult to find, Tho first three are 
given in equations (19), (20) and (21) and tho values of for different/and 8 are given in 
Tablo I. It was pointed out in the discussion of Section 6 that tho non-central /-distribution 
would often he markedly skew. It would appeuT, therefore, that it may not be a suitable 
distribution to express by a development of the normal function. Fortunately it is possible, 
as is pointed out in equation (20), to express problems domanding tiro quantities /(/, 8, c) and 
8(f,t 0 ,e) in a form such that tho solution ean be based on a distribution much more nearly 
normal than that of non-central /. 

It was shown that the statement 

P(t>t„\f,S)=:r. 

was equivalent to the statement 

P(Y <8\f,t 0 ) = <■:, 

where Y= (-z)-(-Z 0 y/~l. 

(Hero * is a unit normal deviate and % is distributed in tho ^-distribution with/ degrees of 
freedom.) The problem of finding fi(/,Z 0 ,e) is therefore equivalent to finding a percentage 
level of Y. 

The cumulants of Y may bo obtained immediately from the formulae 

MY) = xq(~*)T/-H* r (A:). .(44) 

In particular for r = 2, k\( Y) = 1 + /- l /Jx a (y), 

while for 2, x r (Y) =/' ,r /f,x r (y). 

Hence for all r tho shape coefficients y r ( Y) increase from 0 to y r (y) as Z 0 increases from 0 to co. 
They’s of Y are therefore always smaller than those ofy. Hut these latter are quite small oven 
for small/. Hence tho distribution of Y can never be far removed from normality. Tho distri¬ 
bution of Y is therefore suitable for development by Edgeworth's method and Cornish & 
Fisher’s formulae may conveniently be used to provide percentage levels. 

In this approach the problem of finding 8(f, /„, e) appears as more fundamental than that 
of finding /(/, 8, e). Hence Tablo IV is designed to facilitate the direct calculation of 8(f, Z 0 , e). 
Its use to provide /(/, 8, e) demands a process of iteration which is not however difficult. Tho 
actual quantity A(/, Z 0 , e) which is tabled differs by a small amount from K e , the normal 
deviate which is exceeded with probability e. This difference is given by the Gornish-Fishor 
formula. All the terms in this formula were used. This necessitated the use of the first six 
cumulants of y. A method of obtaining these has already been described in a previous note 
(N. L. Johnson & B, L. Welch, 1939). 
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'Die greater part of the computation of Tables IV and V has been the work of one of us 
(N, L. Johnson). The remainder and much subsidiary cheeking work is duo to Miss Catherine 
M. Thompson. We gratefully acknowledge her assistance. 
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MISCELLANEA 

(i) Stirling's formula with remainder* 

By E. V, HUNTINGTON, Harvard University 

Wo assume the following theorem us known: 

n! = -J(2iT)Jnn n er"[e 1 '], 

whorea = 2, 3, 4,and 

n__L 2, 1 _L_ + _i_ 601 , 

~ 12n 360n 3 1200a 6 1680a 7 1188a 9 360360a 11 

= 0-083,333n -1 - 0-002,777n- a + 0-000,794a- 6 - 0-000,595n“ 7 

+ 0-000,842a" 9 - 0-001,918a" u +.... 

Wo assume also that tho error involved in stopping tho series at any point is loss (in 
absolute value) than tho first term dropped. (For proof, see, for example, E. B, Wilson, 
Advanced Calculus, p. 46(1.) 

Putting Q = e 1 ', and expanding and collecting terms, wo lind: 

n ! = J{2n) Jrin n e~ n [Q], 

, , 1 1 139 671 

12» 288»* 51,840a 3 2,488,320a* 

103,879 6,240,819 634,703,531 

+ 209,W8^880n 6 + 76,248,790,800a 11 902,961,601,000a 7 
4,483,131,259 , 432,201,921,612,371 

“ 80,684,309,913,600a 8 + 614,904,800,886,784, OOOn" + 

= 1 + 0-083,333,333a-H 0-003,472,222a" 8 

- 0-002,681,327n- 3 - 0-000,229,472a" 4 

+ 0-000,784,039a- 5 + 0-000,009,728n-» - 0-000,592,1 OOa" 7 

- 0-000,051,718a- 8 + 0-000,839,499a- 9 +.. „ 

(For the terms up to and including -a- 7 , compare, for example, H. T, Davis, Tables, 1, 180.) 

Let <2 2 = the sum of the Q -series up to and including the term in n~ 8 , Q 3 = the sum of the 
Q -series up to and including the term in n“ a , etc. 

Thon we find: 

Q = l + R v where 0-083,33a-* <Ji 1 < 0-086,07a" 1 . 

Q = Qi+K whore 0-002,07?r 8 <H a < 0-003,48a- # . 

QsQj + Hj, whore -0-002,80n- 3 <H 3 <-0-002,59a -3 . 

Q = Q 3 + R t , whero - 0-000,23a“ 1 <l? 4 < 0-000,18a~*. 

Q^Qi + Rs, where 0-000,6 Qn~ i <R t < 0-000,82a- 8 . 

Q~Q b + Rq, whore - 0-000,24a- s <H s < 0-000,07»- 8 . 

Q = + R v whore - 0-000,62a -7 <!?,<- 0-000,39a -7 . 

Q = Qi + R 3 , where-0-000,00a- 8 <1? 8 < 0-000,39a- 8 . 

Q = C, + 2 ? 5 , where -0-000,Ola -9 <H 9 < 0-000,88a -9 . 

* Presented to the American Mathematical Society, 8 April 1939. 
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(ii) A note on the interpretation of quasi-sufficiency 

By M. S. BARTLETT 

1. It has been shown by Fisher that if in problems of location we confine our attention 
to samples with tho same configuration G, where 0 denotes the differences between the 
observations, then no statistical information in his sense is lost whatever reasonable statistic 
T we choose os an estimate of the unknown parameter B. This result follows for probability 
laws of tho typo under consideration, 

p{x\Q)sf{x-6)dx, 

from tho relation p(S \ 0) = p(T j G, O)p(G), .(1) 

whoro /S' denotes the sample of observations. For the conditional statistic T | G-~ a random 
or statistical variable T varying for given or fixed C —I have when a relation like (1) exists 
lined the term quasi-sufficient, in view of the correspondence of (1) with the relation for a 
sufficient statistic T, 

p(S\d)=p(T\6)p(S\T). .(2) 

Conditional statistics also occur in the theory of mors than one unknown, when the fixing 
of some statistical variables has the effect of eliminating unwanted parameters from our 
distributions. Confining our attention, however, to problems which are primarily problems 
in one unknown only, we require to examine relations of the type (1) further, in view of some 
recent comments by Welch* on the extent to which any conditional statistic like T | 0 can 
claim to be sufficient, 

From Fisher’s theory of information, it follows that when estimating 9 from two or more 
samples, wc should naturally weight the different samples according to the information 
1(G) in each. But Welch has pointed out that if we are concerned with interval estimation 
of 0 from a single sample,- or with the most efficient statistical test associated with a value d 0 , 
it does not appear that all our information, in the widest sense, is retained in T \ 0, If, for 
example, the only alternative to 0„ happens to be 6, , it is known that the appropriate criterion 
for discriminating d 0 and isp(/S | d 1 )/p(/S' ( 0„). While from equation (1) it follows that this 
criterion is equivalent to p( T [ 0, 0 X ) /p( T | C, d 0 ), the former criterion must be referred to the 
distribution of S, not of S [ 0, 

In partial answer to this criticism, it might be noticed that when considering S\ C, we are 
not bound to ohoose tho same significance level a for each 0 observed. Suppose we choose 
a(0' 1 ) - e for the first sample. If the second sample gave a different configuration 0 2 , we 
might take «(G S ) such that the power of the test was a maximum for the two configurations 
Cj and O t if we had chosen a{ O x ) such that £{a{ 0-,) + a( 0 2 )} — e. Similarly for G s , and so on. 
In tho long run, the significance level adopted on this rule would be the average value 
E r {a r (G r )), where a r (O r ) denotes the significance level taken for the rth sample when making 
the rth tost. For any finite number of samples the average significance level- adopted is not 
exactly equal to e, but the level becomes equal to e in the limit; thus we could argue that, 
theoretically at least, wo should eventually reach the most powerful test for a given signi¬ 
ficance level merely from consideration of S | 0. In the above argument we allow the con¬ 
figuration distribution to be generated by the samples themselves. It is of course theoretical 
quibbling, for if we may assume that C has a definite distribution p(C) which is already 
known, we should use this fact rather than wait for its confirmation by repeated sampling. 

My own answer to the query of how far T | G can be considered sufficient is that it is only 
sufficient provided that we do really agree to consider samples with tho same configuration 
as that observed. The rejection altogether of the factor p(G) has two consequences, (i) we 
have seen that a test might conceivably be derived on the basis of p(C) more powerful than 

* B. L, Welch. Annals Math. Statist. 10 ( 1939 ), 58 - 69 . 





392 


Miscellanea 


fcho tost based on T \ G with fixed significance level a(G) = c, (iii) the fixing of G implies that 
whatever selection fof 0 there might bo distorting p(G), the validity of tests based on T | G 
will be unaffected. 

The independence of tho test from selection for 0 is tho oxplicit advantage gained, 
though in addition it is possiblo that if our specification of the population is erroneous, that 
it is less so for p[ T j G) than p(S); for example, if tho configuration 0 is similar to tho expected 
configuration on normal theory, it may provo more feasible to assume for a certain range of 
populations differing from normality that normal theory may still bo approximately sub¬ 
stituted for tho unknown probability p(T | 0) than that wo may do so for p(S) or p(T), 
though 1 have not investigated tins point. 

2, It is instructive to remoinber Fisher’s comparison of tno quasi-sufficiont statistic 
T | G with the sufficient statistic T, whore T may bo written in full, T | n, where n is tho size 
of tho sample. Tho equivalence of tho two types of statistic whon we regard 0 as fixed can 
be extended, in terms of our theoretical argument, if conversely wo assume that n had in 
some problem a definite and known distribution from sample to samplo. Consider similarly 
tho test of significance of a regression coefficient. Tho orthodox theory is to consider the 
conditional statistic b | S(x - ») 2 , where b is our ostimato, and —k) s the sum of squares of 
deviations of the independent variable x. Suppose for tho sake of argument that tho true 
variance of the residual dependent variable y x was known to be unity, and tho .r’s are such 
that S[x~xf- l on Mondays and 1*44 on Tuesdays, Then for an 0-025 significance level 
(one tail), the usual practice would bo to take 1-96 as tho significance level for b (from b 0 = 0) 
on Mondays, and 1-90/1*2 = 1-633 on Tuesdays. The power of the tost in relation to tho 
alternative that b l = 3-92 is 0-9860. But if wo were satisfied with adjusting the significance 
level to bo 0-025 merely in tho long run for Mondays and Tuesdays together, we may raise 
the power of tho test to its maximum value of 0-9878, by taking the Monday significance level 
at b = 1-87 (a = 0-0307) and the Tuesday level at b = 1-723 (a = ()•() 194). 


SUMMAttY 

In a, theoretical note on the interpretation of quasi-sufficient statistics in problems of one 
unknown parameter, it is agreed with Welch that full sufficiency properties can only bo 
claimed if we deliberately confine our attention to conditional samples of tho type observed; 
but it is pointed out that tho resulting inferences are independent of any suloction operating 
on the variables that have been fixed. 


(iii) The cumulants and moments of the binomial distribution, and the 
cumulants of x 2 for a (n x 2)-fold table 

By ,'J. B, S. H ALDAN IS , i'Ml.S. 

From the Department of Biometry , University College , London 

The first four cumulants of the distribution of y 2 for a (nx 2).fold table when samples are 
finite, have already been given (Haldane, 1937). These and higher cumulants and moments 
can be calculated by a simpler method. Consider a sample of s, tho probability of a success 
beingp, andp + q = 1. Pearson (1919) pointed out that for moments of (p + q) s about its mean, 
the generating function is (qe pt +pe- qi ) s , and Romanovsky (1923) gave a recurrence formula 
far the moments. That for tho cumulants is much simpler. 

Let JJ — qe pt pe~ Q K 
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or 


Then the cumulant -generating function 

m~ s i4 r =«iogi7. 

r= 2 

To find the cumulants for s = 1, we note that 

= ~l7 *’ 

3 P9(e vt -e- Ql ) 
~K(t) = - - - 

3 d 

So = pq~ K(t) +pqt, 

S K r t r ~ l ® f dK r 

Equating the coefficients of t r /r\, we find 

= P2. 
dK f 


and if r>2 

Lot pq = o, p — q = g; then 


*r+i = P7 


dq 


•( 1 ) 


..( 2 ) 


dK do dq „ , 

* r+1 = C ^’ ^ = !7 ’ * =1_4C> 

Hence if Ac ar =/(c), 

«tr-n = (/ G f'( G )> 

« 0(1— flo)/'(o) + c 2 (l-4c)/"<c).) 

From these equations we can very rapidly calculate successive values of K n sinco K a = c, 
and find: 

*h= 0, 

*3 = c, 


K s =*cg, 
k 4 = c — 6c 2 , 

* 5 = </(c — 12c 2 ), 

/C a = c— 30c 2 +120c 3 , 

K, = </(c - 60c 2 + 360c 3 ), 

/c 8 = c-126c 2 f 1,680c 3 -6,040c 4 , 
k, = g{o - 262c 2 + 5,040c 3 - 20,160c*), 

* l0 = c- 510c 2 + 17,640c 3 - 161,200c* + 362,880c 5 , 

Kn =g(c~ 1,020c 2 + 52,920c 3 - 604,800c* + 1,814,400c 5 ), 

K n = o- 2,046c 2 + 168,960c 3 - 3,160,080c* + 19,958,400c 5 - 39,916,800c 6 .(3) 

If each of the above cumulants be multiplied by s, the moments about the mean can now 
be calculated from the expressions given by Fisher (1928) and Haldane (1938). Iff = £ we 

^ ave K(t) = s log cosh \t, 

s s s 17s 31s _ 691s 

so , Ki=~-, K t =~, k * = ~1q’ K n~ 

while if q is very small we have for the cumulant-generating function of a Poisson series 

•K(i) = ac(e*-l-t). 
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Tho coefficient of e a in - [+ + (- 1 ) r - 3]. So whon q is small, but its square is not neglected, 
the first order correction to tho Poisson cumulant-gcnnrating function is 

K(t) = 8(0 -1 - i) + 8q 3 (c JI - e l ). 

Tho numerical coefficient of tho highost power of c in K r is (r-1)! whon r is oven, and 
l(r-l)! whon r is odd. 

Consider a sample of a, in which a successes are recorded. Then 


t = 


(«-3p) s 

m 


But a - sp is tho departure from the moan of tho binomial distribution [p + q)\ Hence tiro 
rth moment of the distribution of x i (for one degree of freedom) about zero, is 



where /i ir is the 2rth moment of (p + q)\ 

But if n' ( and k' bo tho rth moment about tho mean, and the rth cnrnulant, of tho y 3 
distribution, then 

H, ~ K - m'{, otc., 4 a /4, oto. 

Making tho necessary substitutions, wo find, for the oumulants of in terms of those, of 
tho binomial distribution: 

4 = (sc) -1 Kj, 

4 = (sc)-' s (2k‘ 1 + k 4 ), 

4 ~ H- 3 [8^+2(5^+6/r 1 /e 4 )+/r,], 

4 = {so)"*[48k;J + 48(5k,k!J + 3/rJ/Cj) + 8(4 k 4 + 7k 3 K 5 + 3k,k,) + k 8 ), 

4 as (sc) -<i [384/4 + 900(5/c|} rij + 24 k 4 ) + 80(25^ k 4 + 10s:, 4 + 28k 4 k 3 k 6 

+ fV'.j k 4 ) + 2(63k| + 100k 4 k,+ 60k 3 k 3 + 20K a Kg) + K 4 o), 
tf, = (w)-»[8,B40kS + 9,000(10*1 4 + 340 + 4,800(3^ + 25k, k^ 

+ 844+ k 5 + 240 4- 40( 1324 + 672k, k 4 k s + 189k, k, 

+ 226k, k„ + 300k, k 4 k 8 +1 80k, k, k, + 30k, k 8 ) + 4( 113/c* + 198 k, k 7 

+ 120k 4 k,+66k,k 0 +16k,k 10 ) + k 1! ). .(4). 

We now substitute tho values of k, given in equations (3) multiplied by s, putting 

k - (P(7) -1 = c" 1 . 

We therefore have, for the oumulants of x i with one dogroe of freedom: 

K l= = l, 

K,= 2 + (*-6)a~ l , 

k,= 8 + 2( 1 1* - 50) S' 1 + (ft 8 - 301b +120) s'*, 

K 4 - 48 + 96(4* - 19) 3~ l + 16(7* a ~ 126*4- 420) »--* + (* 3 - 1261:“ +1,680*-6,040)*-', 

K,~ 384 + 960(7*-32)a -1 + 400( 15* a -214* + 648)s~ a + 0(81* 8 

- 3,908* a + 38,420* - 98,496) a' 3 + (* 4 - 610* 3 + 17,640fc a 

-151,200* + 362,880) *-*, 

K,~ 3,840 + 9,600(13* - 58) a -1 + 9,600(26* a - 327* + 924) s -a 

+ 40(1,729* 3 - 66,236*3 + 459,024* -1,065,792) a -3 + 4(501** 

- 59,398*3 + 1,289,244** - 8,824,320* + 18,556,840) «-* 

+ (* 5 - 2,046** + 168,90O* S - 3,160,080** +19,958,400* 

-39,916,800) s~\ 


( 6 ) 
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When p = i, k = 4, and we have, for n degrees of freedom: 

*i = », 

/f a = 2ns _1 (s— I), 
x 3 = 8ns -s (s-1) (a-2), 

K t = 16ns _3 (.s--1) (3s 2 — 15s -|- 17), 

K t = I28ns"»(8 -])(«- 2) (3s 2 - 21s + 31). 

*,= 256ns- 6 (s-l)(15s 4 -210s 8 + 990s»-l,950s+l 1 382), .(6) 

If there are n samples, with different values of s, we have, for the cumulants of y 2 , where 

h = ~, and B, = I4r‘, 

2 pq 

Kx-n, 

k 3 = 2n[l + (4-3) Bj], 

K t ~ 4n[2 + {llh- 28) B, + (4 s - 164 + 30) BJ, 

t( i = 8a[6 + 12(84-19) B x + 4(144 2 -1254+210) B s + (4 s - 63 4 s + 4204 - 630) B,], 

k s ~ 16n[24+120(74-16) B 4 +100( 154 s -1074 +162) B a + 3(814 3 

- 1,9544 s + 9,5604 -12,312) B„ + (h* ~ 2554 3 + 4,4104 2 -18,9004 + 22,680) B 4 ], 
=.82«ri20 + 600(134-29) R l + 600(624 s - 3274 + 462) R 2 +10(1,7294 3 

- 28,1184 s +114,7664 -133,228) B 3 + 2(6014 4 - 29,6994 3 + 322,3114 s 

-1,103,0404 + 1,159,740) B 4 + (4 5 -1.0234 4 + 42,2404 3 - 395,0 1 04 s 

+ 1,247,4004-1,247,400) BJ.(7) 

When p = q = we have: 

K t = 2{n-Bx), 

/c 3 = 8(n~3B 1 + 4B,), 

x 4 = 10(3n~18E 1 +32B a -17B 3 ), 

k s = 128(3w - 30Bi + 100Bj - 135B 8 + 62E 4 ), 

= 256(16n — 225-Rj + 1,200B, - 2,940E 3 + 3,332B 4 - 1,382B 5 ).(8) 

The first four of equations (5, 6, 7, 8) have already been given in a slightly different form 
by Haldane (1937). The limiting forms of equations (5) and (7) when s tends to infinity and 
k to zero, while ks = g, have been given by Haldane (1938). However, the expression for 
there given is incorrect. The coefficient of in the expression for should be 124,800. 

The extension of equations (7) would be rather tedious. However, those of equations (6) 
and (8) would not be very difficult. The coefficient of x 2r j2r\ in the expansion of log cosh t is 
the value of (d/da:) 2r ' 1 ( 1 - tanb 2 x) when x = 0, and can easily be calculated, since this 
differential coefficient is a polynomial in tanhas. The equations for moments in terms of 
cumulants can easily be extended when all odd cumulants vanish. In this case a useful check 
can-bo obtained from the fact that when s == 2 the cumulant-generating function of y 2 
is i + log cosh f. 

SUMMAEY 

Expressions are obtained for the first twelve cumulants of the binomial distribution, and 
a simple recurrence formula for further cumulants. The first six cumulants of y 2 for a 
(n x 2)-fold table when expectations are small, are deduced. 
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CAUSATION AND CORRELATION 

(iv) The principles of the mathematical theory of correlation. By A. A. 
Tschopkow. Translated by M. Kantobowit, sen. Win. Hodge & Co. Ltd. Price 
12a-. I)d. 

This book is a translation of an enlarged reproduction of looturos delivered by Professor 
Tsohuprow at the University of Oslo, and originally printed in Gorman about a decade, 
ago. It will bo of interest to many people in that it is a complete survey of correlation 
theory and its underlying principles. Tho theory is expounded along the now familiar 
classical lines which had their origin in Karl Pearson’s writings, and although the appli¬ 
cation of this theory to practical problems is perhaps a little out of date, it still forms a 
nocossary background which tho student must, acquire, and of which a proper understanding 
will always bo essential. 

Tho book as a whole shows an astonishing “patchiness” in writing. It may bo the 
fault of the translator, but tho faot remains that tho meaning of whole paragraphs is 
sometimes vory obscure, while at other times tho oaso and lucidity with which arguments 
are presented aro unrivalled by any comparable treatise. This unevenness is unfortunate, 
since the book will bo road more for tho exposition of underlying principles than for its 
algebraic development of tiro theory; indeed it may bo questioned whether those who 
have not previously mastered tho elements of statistical theory and probability will obtain 
any profit from its reading. There is no clear discussion of probability and what it means, 
and the reader is loft to find out from examples how a probability may bo calculated. 
This cannot be deemed a fault, but seems to point to tho book being useful only if previous 
knowledge of tho subject is obtained elsewhere, 

Tho chapter on stochastic connexion and functional relationship is good, and may be 
read with advantage by any statistical worker and teacher. The method of approach is 
the same os that of the present day, and will doubtless bo followed for many years to 
come, It is, however, a little astonishing to find that tho translator makes no use of tho 
term "random variable", which has long passed into common use, Tho general discussion 
throughout the book and in particular in the chapter entitled "Object and value of Corre¬ 
lation Measurement” is stimulating, even if tho reader is not always in agreement witli tho 
author. The mathematical development of the theory is sot out with clarity and freshness, 
which make it enjoyable reading. The notation is a little cumbersome, but, once mastered, 
it does not prove difficult to follow. 

Taken as a whole, this book is a worthy contribution to correlation literature, and it is 
surprising that no translation has been published until this date. It certainly should be 
read by all who attempt to gain an understanding of statistical theory. 


F. N. DAVID, 




